HDL_workflow.mdwn

   1 # HDL workflow
   2
   3 This section describes the workflow and some best practices for developing the LibreSoC hardware. We use nmigen, yosys and symbiyosys, and this page is intended not just to help you get set up, it is intended to help advise you of some tricks and practices that will help you become effective team contributors.
   4
   5 It is particularly important to bear in mind that we are not just "developing code", here: we are creating a "lasting legacy educational resource" for other people to learn from, and for businesses and students alike to be able to use, learn from and augment for their own purposes.
   6
   7 It is also important to appreciate and respect that we are funded under NLNet's Privacy and Enhanced Trust Programme <http://nlnet.nl/PET>. Full transparency, readability, documentation, effective team communication and formal mathematical proofs for all code at all levels is therefore paramount.
   8
   9 # Collaboration resources
  10
  11 ## Main contact method: mailing list
  12
  13 To respect the transparency requirements, conversations need to be public and archived (i.e not skype, not telegram, not discord, and anyone seriously suggesting slack will be thrown to the lions).  Therefore we have a
  14 mailing list. Everything goes through there. <http://lists.libre-riscv.org/mailman/listinfo/libre-riscv-dev> therefore please do google "mailing list etiquette" and at the very minimum look up and understand the following:
  15
  16 * This is a technical mailing list with complex topics. Top posting is completely inappropriate. Don't do it unless you have mitigating circumstances, and even then please apologise and explain ("hello sorry using phone at airport flight soon, v. quick reply: ....")
  17 * Always trim context but do not cut excessively to the point where people cannot follow the discussion.  Especially do not cut the attribution ("On monday xxx wrote")
  18 * Use inline replies i.e. reply at the point in the relevant part of the conversation, as if you were actually having a conversation.
  19 * Follow standard IETF reply formatting, using ">" for cascaded indentation of other people's replies.  If using gmail, please: SWITCH OFF RICH TEXT EDITING.
  20 * Please for god's sake do not use "my replies are in a different colour". Only old and highly regarded people still using AOL are allowed to get away with that (such as Mitch).
  21 * Start a new topic with a relevant subject line. If an existing discussion changes direction, change the subject line to reflect the new topic (or start a new conversation entirely, without using the "reply" button)
  22 * DMARC is a pain on the neck. Try to avoid GPG signed messages. sigh.
  23 * Don't send massive attachments. Put them online (no, not on facebook or google drive or anywhere else that demands privacy violations) and provide the link.  Which should not require any kind of login to access. ask the listadmin if you don't have anywhere suitable: FTP access can be arranged.
  24
  25 If discussions result in any actionable items, it is important not to lose track of them. Create a bugreport, find the discussion in the archives <http://lists.libre-riscv.org/pipermail/libre-riscv-dev/>, and put the link actually in the bugtracker as one of the comments.
  26
  27 At some point it may become better to use  <http://bugs.libre-riscv.org> itself to continue the discussion rather than to keep on dropping copies of links into the bugtracker.  The bugtracker sends copies of comments *to* the list however this is 'one-way' (note from lkcl: because this involves running an automated perl script from email, on every email, on the server, that is a high security risk, and i'm not doing it. sorry.)
  28
  29 Also, please do not use the mailing list as an "information or document store". We have the wiki for that.  Edit a page and tell people what you did (summarise rather than drop the entire contents at the list) and include the link to the page.
  30
  31 Or, if it is more appropriate, commit a document (or source code) into the relevant git repository then look up the link in the gitweb source tree browser and post that (in the bugtracker or mailing list) See <http://git.libre-riscv.org>
  32
  33 ## Bugtracker
  34
  35 bugzilla. old and highly effective. sign up in the usual way. any problems, ask on the list.
  36
  37 Please do not ask for the project to be transferred to github or other proprietary nonfree service "because it's soooo  convenient", as the lions are getting wind and gout from overfeeding.
  38
  39 ## ikiwiki
  40
  41 Runs the main libre-riscv.org site (including this page). effective, stunningly light on resources, and uses a git repository not a database.  That means it can be edited offline.
  42
  43 Usual deal: register an account and you can start editing and contributing straight away.
  44
  45 Assistance in creating a much better theme greatly appreciated.
  46
  47 ## git
  48
  49 we use git. more on this below.  we also use gitolite3 running on a dedicated server.  again, it is extremely effective and low resource utilisation.  reminder: lions are involved if github is mentioned.
  50
  51 gitweb is provided which does a decent job. <http://git.libre-riscv.org>
  52
  53 ## server
  54
  55 as an aside: all this is "old school" and run on a single core 512MB VM with only a 20GB HDD allocation. it costs only 8 GBP per month from mythic-beasts and means that the project is in no way dependent on anyone else - not microsoft, not google, not facebook, not amazon.
  56
  57 we tried gitlab. it didn't go well.
  58
  59 # Hardware
  60
  61 RAM is the biggest requirement. Minimum 16GB, the more the better (32 or 64GB starts to reach "acceptable" levels.  Disk space is not hugely critical: 256GB SSD should be more than adequate.  Simulations and FPGA compilations however are where raw processing power is a must.  High end Graphics Cards are nonessential.
  62
  63 What is particularly useful is to have hi-res screens (curved is *strongly* recommended if the LCD is over 24in wide, to avoid eyeballs going "prism" through longterm use), and to have several of them: the more the better.  Either a DisplayLink UD160A (or more modern variant) or simply using a second machine (lower spec hardware because it will run editors) is really effective.
  64
  65 Also it is really recommended to have a UHD monitor (4k - 3840x2160), or at least 2560x1200.  If given a choice, 4:3 aspect ratio is better than 16:9 particularly when using several of them. However, caveat (details below): please when editing do not assume that everyone will have access to such high resolution screens.
  66
  67 # Operating System
  68
  69 First install and become familiar with Debian (ubuntu if you absolutely must) for standardisation cross-team and so that toolchain installation is greatly simplified.  yosys in particular warns that trying to use Windows, *BSD or MacOS will get you into a world of pain.
  70
  71 Only a basic GUI desktop is necessary: fvwm2, xfce4, lxde are perfectly sufficient (alongside wicd-gtk for network management). Other more complex desktops can be used however may consume greater resources.
  72
  73 # editors and editing
  74
  75 Whilst this is often a personal choice, the fact that many editors are GUI based and run fullscreen with the entire right hand side *and* middle *and* the majority of the left side of the hi-res screen entirely unused and bereft of text leaves experienced developers both amused and puzzled.
  76
  77 At the point where such fullscreen users commit code with line lengths well over 160 characters, that amusement quickly evaporates.
  78
  79 Where the problems occur with fullscreen editor usage is when a project is split into dozens if not hundreds of small files (as this one is). At that point it becomes pretty much essential to have as many as six to eight files open *and on-screen* at once, without overlaps i.e. not in hidden tabs, next to at least two if not three additional free and clear terminals into which commands are regularly and routinely  typed (make, git commit, nosetests3 etc).
  80
  81 (hint/tip: fvwm2 set up with "mouse-over to raise focus, rather than additionally requiring a mouseclick, can save a huge amount of cumulative development time here, switching between editor terminal(s) and the command terminals).
  82
  83 Once this becomes necessary, it it turn implies that having greater than 80 chars per line - and running editors fullscreen - is a severe hindance to an essential *and highly effective* workflow technique.
  84
  85 Additionally, care should be taken to respect that not everyone will have 200+ column editor windows and the eyesight of a hawk. They may only have a 1280 x 800 laptop which barely fits two 80x53 xterms side by side.  Consequently, having excessively long functions is also a hindrance to others, as such developers with limited screen resources would need to continuously page-up and page-down to read the code even of a single function, in full.
  86
  87 This helps explain in part, below, why compliance with pep8 is enforced, including its 80 character limit.  In short: not everyone has the same "modern" GUI workflow or has access to the same computing resources as you, so please do respect that.
  88
  89 # Software prerequisites
  90
  91 Whilst many resources online advocate "sudo" in front of all root-level commands below, this quickly becomes tiresome. run "sudo bash", get a root prompt, and save yourself some typing.
  92
  93 * sudo bash
  94 * apt-get install vim exuberant-ctags
  95 * apt-get install build-essential
  96 * apt-get install git python3.7 python3.7-dev python-nosetest3
  97 * apt-get install graphviz xdot gtkwave
  98 * return to user prompt (ctrl-d)
  99
 100 This will get you python3 and other tools that are needed. graphviz is essential fir showing the interconnections between cells, and gtkwave is essential for debugging.
 101
 102 ## git
 103
 104 Look up good tutorials on how to use git effectively.  There are so many it is hard to recommend one. This is however essential. If you are not comfortable with git, and you let things stay that way, it will seriously impede development progress.
 105
 106 If working all day you should expect to be making at least two commits per hour, so should become familiar with it very quickly.  If you are *not* doing around 2 commits per hour, something is wrong and you should read the workflow instructions below more carefully, and also ask for advice on the mailing list.
 107
 108 ## yosys
 109
 110 Follow the source code (git clone) instructions here: <http://www.clifford.at/yosys/download.html>
 111
 112 Do not try to use a fixed revision (currently 0.9), nmigen is evolving and frequently interacts with yosys
 113
 114 ## symbiyosys
 115
 116 Follow the instructions here: <https://symbiyosys.readthedocs.io/en/latest/quickstart.html#installing>
 117
 118 You do not have to install all of those (avy, boolector can be left out if desired) however the more that are installed the more effective the formal proof scripts will be (less resource utilisation in certain circumstances).
 119
 120 ## nmigen
 121
 122 nmigen may be installed as follows:
 123
 124 * mkdir ~/src
 125 * cd !$
 126 * git clone https://github.com/m-labs/nmigen.git
 127 * cd nmigen
 128 * sudo bash
 129 * python3 setup.py develop
 130 * ctrl-d
 131
 132 testing can then be carried out with "python3 setp.py test"
 133
 134 ## Softfloat and sfpy
 135
 136 These are a test suite dependency for the ieee754fpu library, and will be changed in the future to use Jacob's algorithmic numeric library.  In the meantime the README describing the process is here:
 137 <https://git.libre-riscv.org/?p=ieee754fpu.git;a=blob;f=README.md;h=d219864a341e4b656680de476e385b6a7f70fb9b;hb=HEAD>
 138
 139 # Registering for git repository access
 140
 141 After going through the onboarding process and having agreed to take responsibility for certain tasks, ask on the mailing list for git repository access, sending in a public key (id_rsa.pub). If you do not have one then generate it with ssh-keygen -t rsa. You will find it in ~/.ssh
 142
 143 NEVER SEND ANYONE THE PRIVATE KEY.  By contrast the public key, on account of being public, is perfectly fine to make... err... public.
 144
 145 Create a file ~/.ssh/config with the following lines:
 146
 147     Host git.libre-riscv.org
 148     Port 922
 149
 150 Wait for the Project Admin to confirm that the ssh key has been added to the required repositories.  Once confirmed,
 151 you can clone any of the repos at http://git.libre-riscv.org:
 152
 153     git clone gitolite3@git.libre-riscv.org:REPONAME.git
 154
 155 # Checking out the HDL repositories
 156
 157 * mkdir ~/src
 158 * cd !$
 159 * git clone gitolite3@git.libre-riscv.org:soc.git
 160 * git clone gitolite3@git.libre-riscv.org:ieee754fpu.git
 161
 162 In each of these directories, track down the setup.py file, then, as root (sudo bash) run the following:
 163
 164 * python3 setup.py develop
 165
 166 The reason for using "develop" mode is that the code may be edited in-place yet still imported "globally".  There are variants on this theme for multi-user machine use however it is often just easier to get your own machine these days.
 167
 168 If "python3 setup.py install" is used it is a pain: edit, then install. edit, then install. It gets extremely tedious, hence why "develop" was created.
 169
 170 # Development Rules
 171
 172 team communication:
 173
 174 * communicate on the mailing list or the bugtracker an intent to take responsibility for a particular task.
 175 * assign yourself as the bug's owner
 176 * *keep in touch* about what you are doing, and why you are doing it.
 177 * if you cannot do something that you have taken responsibility for, then unless it is a dire personal emergency please say so, on-list. we won't mind. we'll help sort it out.
 178
 179 regarding the above it is important that you read, understand, and agree to the [[charter]] because the charter is about ensuring that we operate as an effective organisation.  It's *not* about setting rules and meting out punishment".
 180
 181 for actual code development:
 182
 183 * **do not commit autogenerated output**. write a shell script and commit that, or add a Makefile to run the command that generates the output, but **do not** add the actual output of **any** command to the repository.  ever.  this is really important.  even if it is a human-readable file rather than a binary object file.
 184 * if the command needed to create any given autogenerated output is not currently in the list of kniwn project dependencies, first consult on the list if it is okay to make that command become a hard dependency of the project (hint: java, node.js php and .NET commands may cause delays in response time due to other list participants laughing hysterically), and after a decision is made, document the dependency and how its source code is obtained and built (hence why it has to be discussed)
 185 * if you find yourself repeating commands regularly, chances are high that someone else will need to run them, too. therefore, put them into a .sh shell script (and/or a Makefile) and document them at the very minimum in README or INSTALL.txt or somewhere in a docs folder as appopriate.  if unsure, ask on the mailing list for advice.
 186 * plan in advance to write not just code but a full test suite for that code.  **this is not optional**. large python projects that do not have unit tests **FAIL**.
 187 * edit files making minimal *single purpose* modifications (even if it involves multiple files. Good extreme example: globally changing a function name across an entire codebase is one purpose, one commit, yet hundreds of files).
 188 * prior to committing make sure that relevant unit tests pass, or that the change is a zero-impact addition.
 189 * commit no more than 5 to 10 lines at a time, with a CLEAR message (no "added this" or "changed that").
 190 * if as you write you find that the commit message involves a *list* of changes or the word "and", then STOP. do not proceed: it is a "red flag" that the commit has not been properly broken down into separate-purpose commits. ask for advice on-list.
 191 * if it is essential to commit large amounts of code, ensure that it is **not** in use **anywhere** by any other code.
 192
 193 the reason for the above is because python is a weakly typed language.  make one tiny change at the base level of the class hierarchy and the effect may be disastrous.
 194
 195 therefore, if you are absolutely certain that a new addition (new file, new class, new function) is not going to have any side-effects, committing it (a large amount of code) is perfectly fine.
 196
 197 as a general rule, however, do not use this an an excuse to write code first then write unit tests as an afterthought.  write *less* code *in conjunction* with its (more basic) unit tests, instead.
 198
 199 the reason for separating out commits to single purpose only becomes obvious (and regretted if not followed) when, months later, a mistake has to be tracked down and reverted.  if the commit does not have an easy-to-find message, it cannot even be located, and once found, if the commit confuses several unrelated changes, not only the diff is larger than it should be, the reversion process becomes extremely painful.
 200
 201 * all code needs to conform to pep8.  use either pep8checker or better run autopep8.  however whenever committing whitespace changes, *make a separate commit* with a commit message "whitespace" or "autopep8 cleanup".
 202 * pep8 REQUIRES no more than 80 chars per line. this is non-negotiable. if you think you need greater than 80 chars, it *fundamentally* indicates poor code design. split the code down further into smaller classes and functions.
 203 * TBD there is a docstring checker.  at the minimum make sure to have an SPD license header, module header docstring, class docstring and function docstrings on at least non-obvious functions.
 204 * make liberal but not excessive use of comments.  describe a group of lines of code, with terse but useful comments describing the purpose, documenting any side-effects, and anything that could trip you or other developers up.  unusual coding techniques should *definitely* contain a warning.
 205 * unless they are very closely related, only have one module (one class) per file. a file only 25 lines long including imports and docstrings is perfectly fine however don't force yourself. again, if unsure, ask on-list.
 206 * *keep files short and simple*. see below as to why
 207 * create a decent directory hierarchy but do not go mad. ask for advice if unsure
 208 * please do not use "from module import *". it is extremely bad practice, causes unnecessary resource utilisation, makes code readability extremely difficult, and results in unintended side-effects.
 209 * try to keep both filenames and variable names short but not ridiculously obtuse. an interesting compromise on imports is "from ridiculousfilename import longsillyname as lsn", and to assign variables as well: "comb = m.d.comb" followed by multiple "comb += nmigen_stmt" lines is a good trick that can reduce code indentation by 6 characters without reducing clarity.
 210
 211 regarding code structure: we decided to go with small modules that are both easy to analyse, as well as fit onto a single page and be readable when displayed as a visual graph on a full UHD monitor.  this is done as follows:
 212
 213 * using the capability of nmigen (TODO crossref to example) output the module to a yosys ilang (.il) file
 214 * in a separate terminal window, run yosys
 215 * at the yosys prompt type "read_ilang modulename.il"
 216 * type "show top" and a graphviz window should appear. note that typing show, then space, then pressing the tab key twice will give a full list of submodules (one of which will be "top")
 217
 218 you can now fullsize the graphviz window and scroll around.  if it looks reasonably obvious at 100% zoom, i.e the connections can be clearly related in your mind back to the actual code (by matching the graph names against signals and modules in the original nmigen code) and the words are not tiny when zoomed out, and connections are not total incomprehensible spaghetti, then congratulations, you have well-designed code. If not, then this indicates a need to split the code further into submodules and do a bit more work.
 219
 220 The reasons for doing a proper modularisation job are several-fold:
 221
 222 * firstly, we will not be doing a full automated layout-and-hope using alliance/coriolis2, we will be doing leaf-node thru tree node half-automated half-manual layout, finally getting to the floorplan, then revising and iteratively adjusting.
 223 * secondly, examining modules at the gate level (or close to it) is just good practice.  poor design creeps in by *not* knowing what the tools are actually doing (word to experienced developers: yes, we know that the yosys graph != final netlist).
 224 * thirdly, unit testing, particularly formal proofs, is far easier on small sections of code, and complete in a reasonable time.
 225
 226 # TODO Tutorials
 227
 228 Find appropriate tutorials for nmigen and yosys, as well as symbiyosys.
 229
 230 * Although a verilog example this is very useful to do <https://symbiyosys.readthedocs.io/en/latest/quickstart.html#first-step-a-simple-bmc-example>
 231 * This tutorial looks pretty good and will get you started <http://blog.lambdaconcept.com/doku.php?id=nmigen:nmigen_install> and walks not just through simulation, it takes you through using gtkwave as well.
 232 * There exist several nmigen examples which are also executable <https://github.com/m-labs/nmigen/tree/master/examples/> exactly as described in the above tutorial (python3 filename.py -h)