Difference between revisions of "VPP/Documentation"
(describe how to document functions) |
m (→Describing functions and their parameters) |
||
Line 101: | Line 101: | ||
Things to note: | Things to note: | ||
− | * The <code>@brief</code> is exactly that: brief. It is a one-sentence description of the purpose of the function. In particular note that it ends with a period '.' | + | * The <code>@brief</code> is exactly that: brief. It is a one-sentence description of the purpose of the function. In particular note that it ends with a period '.'. This is important since that is what ends a sentence; otherwise the text from the next paragraph would run-on as part of the brief description when Doxygen parses it. It is also possible to dispense with the <code>@brief</code> command; when Doxygen notices it is missing it will use the first sentence of the long description for the brief sentence instead. |
* The <code>@param</code> commands do not provide extra formatting (like hyphens '-') nor do they repeat the type information from the signature (since that would be redundant and often leads to errors. | * The <code>@param</code> commands do not provide extra formatting (like hyphens '-') nor do they repeat the type information from the signature (since that would be redundant and often leads to errors. | ||
** Doxygen automatically includes type information in the function signature section | ** Doxygen automatically includes type information in the function signature section | ||
** There are some places that do include type information, including the boiler plate node documentation template, but only because that is an apt way to describe the parameter. It is not encouraged. | ** There are some places that do include type information, including the boiler plate node documentation template, but only because that is an apt way to describe the parameter. It is not encouraged. | ||
** It's important to remember that this information is captured semantically for use by other tools, not just HTML page rendering. Use only Doxygen formatting commands if formatting is needed. | ** It's important to remember that this information is captured semantically for use by other tools, not just HTML page rendering. Use only Doxygen formatting commands if formatting is needed. | ||
+ | ** The start of parameter descriptions has been horizontally aligned to aid readability of the source. | ||
* The <code>@returns</code> (or <code>@return</code>) also does not include type information. | * The <code>@returns</code> (or <code>@return</code>) also does not include type information. | ||
** Often just one <code>@returns</code> is sufficient but it is also valid to use more than one if that reads better, as in this example. | ** Often just one <code>@returns</code> is sufficient but it is also valid to use more than one if that reads better, as in this example. |
Revision as of 13:32, 31 August 2016
The VPP project currently uses Doxygen as the mechanism to generate documentation. This is predominantly focused on providing developer-focused information but it can also be used to generate user-focused details.
Doxygen works by parsing source files and identifying special comment blocks that are adjacent to identifiers in the code. We have Doxygen configured to generate output for items that are not yet documented so that at the very least the names of function parameters or structure members are visible.
This wiki page aims to cover how developers can go about producing documentation. Whilst it will cover a number of Doxygen features, readers are encouraged to review the Doxygen documentation and in particular special commands page.
Lacking an official home for the generated documentation, at this time master
branch is processed every six hours and visible at https://vpp.flirble.org/master/. The most recent stable release is also available at https://vpp.flirble.org/stable/.
Documenting the code
Language
The language of written documentation in VPP is English. It is tempting, as developers, to write tersely with the belief that doing so will adequately convey the relevant details. This should be avoided since readers of the resulting documentation may find the text halting and lacking context. This is especially true if English is not the readers' native tongue.
As a consequence documentation should be written with structurally correct English complete with capitalization, sentences and paragraphs where appropriate. Assumptions regarding context should be avoided. Spelling mistakes and grammar are easily fixed later; structure and context are much harder to fill in.
What to document
In an ideal world we would document everything; however given the realities of life and open source projects here are a list of items that should be prioritized for documenting and where the better the documentation the better able VPP is to mature:
- Graph node functions.
- See the graph node template below.
- Widely-used library functions, structures and pre-processor macros.
- Most of
vppinfra
andvlib
. - Node and feature lookup, insertion or other manipulation functions.
-
*_format
and*_unformat
functions. - FIB and other centralized data store and lookup mechanisms.
- Most of
-
*_main_t
and other primary data structures used for interacting with modules. - CLI
short_help
andlong_help
.- These will eventually be extracted for automatic documentation.
- API definitions.
- At the moment most API documentation just describes the parameters; better description of what API calls do is necessary in many cases.
- Anything else that may be of use to the developer who comes after you.
In general, static
and inline
functions within modules are not visible outside that module and thus adding a documentation block may be of limited value (with exceptions, such as when used for function pointers for consumption elsewhere); though this does not excuse a developer from writing useful comments. However static and inline functions in header files are much more widely visible and should be documented.
In general, even though the purpose of an item may be obvious from its name, when rendered as documentation this implicit meaning is not always forthcoming. Thus it is recommended that such items not be skipped from consideration merely because the meaning seems plain. For example, structures (and unions) require documentation describing both the structure itself as well as all of its members. Similarly enumerations should have documentation both for the enumeration as a whole as well as individual items defined by it. It is very apparent in the generated documentation when developers only partially document these things.
Global variables must be documented. Since global variables are evil but are sometimes necessary we should encourage developers to justify their existence, to apologize to future programmers and beg their forbearance for the indiscretion.
Documentation conventions
To promote consistency both in the VPP source code and in the generated text, these are the conventions VPP have adopted for writing documentation.
Doxygen comments/commands
Doxygen requires the use of specially marked comment blocks to identify documentation and recognizes several special commands inside those blocks. It provides several ways to indicate both of these things. Whilst any will work, VPP as a project has adopted the following as conventions:
- Pre-identifier comment blocks are signified with
/** ... */
.- These are documentation blocks that come immediately before the code that they provide documentation for.
- Post-identifier comment blocks are given with
/**< ... */
.- These are documentation blocks that come immediately after the code that they provide documentation for.
- In some situations using
///< ...
may make the code more readable and this is acceptable.
- Special commands are prefixed with the at-sign, for example
@param
or@brief
.- Though you may often find the use of a back-slash in the existing code (especially
\brief
) these are being migrated to using at-sign. - Note that the Doxygen documentation will show use of back-slash throughout, but it notes that the two symbols are equal and interchangeable.
- Though you may often find the use of a back-slash in the existing code (especially
Referring to identifiers in the text
Doxygen will automatically spot the names of known functions in the text and link those to the definition of that function. For other identifiers you must tell it to using @ref
and this is encouraged in all places; note that not all things are linkable in this way (refer to the Doxygen documentation) but it is harmless to use @ref
in such places.
For other items, such as function parameters, local variables or constants the use of @c
is recommended. This will mono-space the text thus providing contrast with the surrounding text. Note that this works only for a single word; <tt> ... </tt>
can be used for multiple words or @code ... @endcode
for a C code block.
Describing functions and their parameters
Functions are typically documented with a docblock immediately before the function definition. They contain a brief description, a fuller description, a description of each parameter and information on any return values.
This is straightforward to use, like so:
/** * @brief A widget coloring function. * * This is the function for changing the color of widgets. * * @note It is possible to * * @param widget A reference to the widget to operate on. * @param type The type of operation. The widget will be obliterated * if this value is prime. Colors will be opaque if * the value is narcissistic. Transparency can be determined * by rotating the number 90 degrees. Any other value will * change its color. Negative values have undesirable effects * if you are afraid of spiders. * * @returns 42 on good days. * @returns Random numbers otherwise. */ static word widget_colorizer (widget_t *widget, word value) { ... return 42; }
Things to note:
- The
@brief
is exactly that: brief. It is a one-sentence description of the purpose of the function. In particular note that it ends with a period '.'. This is important since that is what ends a sentence; otherwise the text from the next paragraph would run-on as part of the brief description when Doxygen parses it. It is also possible to dispense with the@brief
command; when Doxygen notices it is missing it will use the first sentence of the long description for the brief sentence instead. - The
@param
commands do not provide extra formatting (like hyphens '-') nor do they repeat the type information from the signature (since that would be redundant and often leads to errors.- Doxygen automatically includes type information in the function signature section
- There are some places that do include type information, including the boiler plate node documentation template, but only because that is an apt way to describe the parameter. It is not encouraged.
- It's important to remember that this information is captured semantically for use by other tools, not just HTML page rendering. Use only Doxygen formatting commands if formatting is needed.
- The start of parameter descriptions has been horizontally aligned to aid readability of the source.
- The
@returns
(or@return
) also does not include type information.- Often just one
@returns
is sufficient but it is also valid to use more than one if that reads better, as in this example.
- Often just one
Otherwise common-sense should prevail. Describe what a function does, what its side-effects are and how it should be used. For more complex functions try to include examples of how to use it.
Describing function-like macros and their parameters
Macros can be documented just like functions with one significant difference: Doxygen cannot guess the type of each parameter or any return value and thus the documentation must mention parameter typing if it is important.
It's also important to mention the side-effects a macro may have or other local variables that must be present for it to work.
Extra commands
These commands have been added to Doxygen (using the ALIAS
mechanism) to assist with documenting specific items in a consistent way.
@node
: Indexing graph node identifiers
The @node
command is used to document the name of graph nodes. The command makes use of the Doxygen @xrefitem
command to generate an index of graph nodes and renders the name of the node in mono-spaced type within the node documentation.
See below in the node template section for an example of how this is used and to see now the node name is rendered within the documentation output. The index, when rendered, will look something like this:
Documentation templates
Source code file header
All of our source code files are required to include the Apache 2.0 license preamble. To properly document files they should also contain at minimum a brief description of the contents of that module and preferably a longer description to give some detail; for example, describing how to interact with it, the key functionality it provides and any special items of note.
/* * Copyright (c) <current-year> <your-affiliation-here>. * * Licensed under the Apache License, Version 2.0 (the "License"); * you may not use this file except in compliance with the License. * You may obtain a copy of the License at: * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ /** * @file * @brief Your brief description goes here. * * Your much longer description goes here. This should be written in * structurally correct English, with proper capitalization, sentences * and paragraphs. */
An example of the @file
section might read:
/**
* @file
* @brief Unix stdin/socket command line interface.
*
* Provides a command line interface so humans can interact with VPP.
* This is predominantly a debugging and testing mechanism.
*/
Pay careful attention that the @file
directive is provided as-is. It's tempting to add a filename after this directive but please do not do this, it only confuses Doxygen.
Directory description
So that directories are appropriately described in the file browser each directory with source code in should have a file with the name dir.dox
. At minimum that file should contain a brief description of the contents of the directory but it could also contain a much longer description.
Refer to the file doxygen/dir.dox.sample
:
/* * Copyright (c) <current-year> <your-affiliation-here>. * * Licensed under the Apache License, Version 2.0 (the "License"); * you may not use this file except in compliance with the License. * You may obtain a copy of the License at: * * http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */ /** @dir @brief Someone please fix this description @todo This directory needs a description. This is where you would document the contents of a directory. This looks like a C file but it is not part of the build; it is purely for documentation. */
Pay careful attention to the @dir
directive; it must be provided as-is. It is tempting to provide a path after it but please do not do this; it often confuses Doxygen.
Documenting a graph node function
Nodes form the core of what VPP does and providing meaningful documentation about their function is important. This is doubly so for those nodes which are direct ancestors or descendants of a node a developer is working; that individual will be looking for details on the features the surrounding nodes make available. Examples may include the metadata passed along in the buffer opaque data which will help a node understand where a packet came from. Some nodes will expect data in the opaque section in order to steer their behavior.
By way of example, here is the documentation block for the "ip4-lookup
" node and which may be used as a template:
/** * @brief IPv4 lookup node. * @node ip4-lookup * * This is the main IPv4 lookup dispatch node. * * @param vm vlib_main_t corresponding to the current thread. * @param node vlib_node_runtime_t data for this node. * @param frame vlib_frame_t whose contents should be dispatched. * * @par Graph mechanics: buffer metadata, next index usage * <em>Uses:<em> * - <code>vnet_buffer(b)->sw_if_index[VLIB_RX]</code> * - Indicates the @c sw_if_index value of the interface that the * packet was received on. * - <code>vnet_buffer(b)->sw_if_index[VLIB_TX]</code> * - When the value is @c ~0 then the node performs a longest prefix * match (LPM) for the packet destination address in the FIB attached * to the receive interface. * - Otherwise perform LPM for the packet destination address in the * indicated FIB. In this case <code>[VLIB_TX]</code> is a FIB index * value (0, 1, ...) and not a VRF id. * * <em>Sets:<em> * - <code>vnet_buffer(b)->ip.adj_index[VLIB_TX]</code> * - The lookup result adjacency index. * * <em>Next Index:</em> * - Dispatches the packet to the node index found in * ip_adjacency_t @c adj->lookup_next_index * (where @c adj is the lookup result adjacency). */ static uword ip4_lookup (vlib_main_t * vm, vlib_node_runtime_t * node, vlib_frame_t * frame) { ... }
When rendered, this documentation block will look something like this:
Of particular note this example describes these things:
- The node name is documented with the special "
@node
" command. - A description of what extra data it uses as inputs; in this case the text describes how
ip4-lookup
makes use ofvnet_buffer(b)‑>sw_if_index[VLIB_RX]
andvnet_buffer(b)‑>sw_if_index[VLIB_TX]
. - A description of what extra data is passed on to descendants; in this case
vnet_buffer(b)‑>sw_if_index[VLIB_TX]
. - A description of how the next node index is determined.
In this case the bulk of the node functionality can be described in terms of the sw_if_index
sections; other nodes may need to provide more verbiage in other sections of the text. In particular, where nodes have other side-effects those should be clearly documented.
Build tools
All documentation generation related dependencies are installed with the install-dep
target of the top level Makefile.
Building the documentation
The top level Makefile of the VPP build tree also includes a doxygen
build target that will run Doxygen over the complete VPP source tree. Invoking that target will produce a significant amount of output and would look something like this:
$ make doxygen Adding custom extension mapping: .def will be treated as language c Adding custom extension mapping: .api will be treated as language c Parsing layout file /home/chrisy/dev/fdio/vpp/doxygen/layout.xml... Searching for include files... Searching for files in directory /home/chrisy/dev/fdio/vpp/vppinfra Searching for files in directory /home/chrisy/dev/fdio/vpp/vppinfra/autom4te.cache Searching for files in directory /home/chrisy/dev/fdio/vpp/vppinfra/config ... Patching output file 10218/10219 Patching output file 10219/10219 lookup cache used 25684/65536 hits=502842 misses=26575 finished...
Previewing the generated documentation
By default only HTML documentation is produced and it is built in build-root/docs/html
. If you build the documentation on a workstation then you can simply open index.html
in that directory with a web browser to view it, for example:
$ make doxygen ... $ sensible-browser build-root/docs/html/index.html
Alternatively if you generate your documentation on a remote machine then an approach might be to add an HTTP server (such as Nginx or Apache HTTPD) to that host and configure it to expose the build-root/docs/html
directory; one mechanism is to symbolically link to this directory from inside some other directory already visible from the HTTP server, assuming the use of symbolic links is allowed. Details on this are beyond the scope of this page.
Building documentation for a specific directory or file
Producing the documentation for the entire build tree can take some time which can become an issue if a developer is working on documentation for a specific module or directory tree. As a convenience it is possible to override the set of directories into which Doxygen will descend when searching for files to parse. For example:
$ make doxygen DOXY_INPUT=vppinfra
or, if you wanted just one file:
$ make doxygen DOXY_INPUT=vppinfra/vppinfra/pool.h
Cleaning up after Doxygen
It is also possible to clean up the Doxygen output directory. This can become useful since Doxygen does not itself remove files from the build directory on subsequent runs should they become redundant.
$ make wipe-doxygen
This is especially worth noting should you have previously built the documentation for the whole source tree and then want to build it for only one directory or file (using DOXY_INPUT
); whilst unlikely, the remnants of previous documentation may interfere with subsequent Doxygen runs for a narrower scope.