From a08a2f89b6cdcb5d97a8b477ac9cdcab3d3e8908 Mon Sep 17 00:00:00 2001 From: Sergey Lebedev Date: Thu, 17 Oct 2024 09:53:26 +0000 Subject: [PATCH] DOCS: fix doxygen --- .github/workflows/docs.yaml | 2 +- docs/doxygen/collectives.md | 4 ++-- docs/doxygen/design.md | 6 +++--- docs/doxygen/events.md | 8 +++----- docs/doxygen/teams.md | 11 +++++------ docs/doxygen/ucc_doxygen.conf | 4 ++-- src/ucc/api/ucc.h | 7 +++++++ src/ucc/api/ucc_def.h | 4 ++-- 8 files changed, 25 insertions(+), 21 deletions(-) diff --git a/.github/workflows/docs.yaml b/.github/workflows/docs.yaml index c584be8c67..72dd1f3835 100644 --- a/.github/workflows/docs.yaml +++ b/.github/workflows/docs.yaml @@ -9,7 +9,7 @@ jobs: - name: Install dependencies run: | sudo apt-get update - sudo apt-get install -y --no-install-recommends doxygen doxygen-latex + sudo apt-get install -y --no-install-recommends doxygen doxygen-latex cm-super texlive-fonts-recommended - uses: actions/checkout@v1 - name: Build UCC docs run: | diff --git a/docs/doxygen/collectives.md b/docs/doxygen/collectives.md index d1f64b3be1..eb0140ebef 100644 --- a/docs/doxygen/collectives.md +++ b/docs/doxygen/collectives.md @@ -169,5 +169,5 @@ and it is returned to the user, when the collective is completed. \b The table below lists the necessary fields that user must initialize depending on the collective operation type. -\image latex ucc\_coll\_args\_table1.pdf width=\textwidth -\image latex ucc\_coll\_args\_table2.pdf width=\textwidth +\image latex "ucc_coll_args_table1.pdf" width=\textwidth +\image latex "ucc_coll_args_table2.pdf" width=\textwidth diff --git a/docs/doxygen/design.md b/docs/doxygen/design.md index 72fa701e3a..9cc7d9368f 100644 --- a/docs/doxygen/design.md +++ b/docs/doxygen/design.md @@ -1,4 +1,4 @@ -### Design +### Design * Highly scalable and performant collectives for HPC, AI/ML and I/O workloads * Nonblocking collective operations that cover a variety of programming models @@ -10,5 +10,5 @@ ### Component Diagram -\image latex ucc\_components.pdf "UCC Components and Usage" -\image html ucc\_components.png "UCC Components and Usage" +\image latex "ucc_components.pdf" "UCC Components and Usage" +\image html "ucc_components.png" "UCC Components and Usage" diff --git a/docs/doxygen/events.md b/docs/doxygen/events.md index 8986569682..e5d78eeba2 100644 --- a/docs/doxygen/events.md +++ b/docs/doxygen/events.md @@ -3,7 +3,7 @@ The execution engine is an execution context that supports event-driven network execution on the CUDA streams, CPU threads, and DPU threads. It is intended to interact with execution threads that are asynchronous (offloaded collective -execution) which can be implemented on GPUs, DPUs, or remote CPUs. +execution) which can be implemented on GPUs, DPUs, or remote CPUs. UCC supports triggering collective operations by library-generated and user-generated events. The library events are generated on posting or completion @@ -69,7 +69,5 @@ are related to the compute event. (5) On completion of the collective operation, the library posts UCC\_EVENT\_COLLECTIVE\_COMPLETE event to the completion event queue. -\image latex ucc\_events.pdf "UCC Execution Engine and Events" -\image html ucc\_events.png "UCC Execution Engine and Events" - - +\image latex "ucc_events.pdf" "UCC Execution Engine and Events" +\image html "ucc_events.png" "UCC Execution Engine and Events" diff --git a/docs/doxygen/teams.md b/docs/doxygen/teams.md index db331fe82b..fa6b8295fd 100644 --- a/docs/doxygen/teams.md +++ b/docs/doxygen/teams.md @@ -1,18 +1,18 @@ ## Teams -The ucc\_team\_h is a team handle, which encapsulates the resources required for group operations such as collective communication operations. The participants of the group operations can either be an OS process, a control thread or a task. +The ucc\_team\_h is a team handle, which encapsulates the resources required for group operations such as collective communication operations. The participants of the group operations can either be an OS process, a control thread or a task. -Create and destroy routines: ucc\_team\_create\_post routine is used to create the team handle and ucc\_team\_create\_test routine for learning the status of the create operation. The team handle is destroyed by the ucc\_team\_destroy operation. A team handle is customized using the user configured ucc\_team\_params\_t structure. +Create and destroy routines: ucc\_team\_create\_post routine is used to create the team handle and ucc\_team\_create\_test routine for learning the status of the create operation. The team handle is destroyed by the ucc\_team\_destroy operation. A team handle is customized using the user configured ucc\_team\_params\_t structure. \b Invocation semantics: The ucc\_team\_create\_post is a nonblocking collective operation, in which the participants are determined by the user-provided OOB collective operation. Overlapping of multiple ucc\_team\_create\_post operations are invalid. Posting a collective operation before the team handle is created is invalid. The team handle is destroyed by a blocking collective operation; the participants of this collective operation are the same as the create operation. When the user does not provide an OOB collective operation, all participants calling the ucc\_create\_post operation will be part of a new team created. -\b Communication Contexts: Each process or a thread participating in the team creation operation contributes one or more communication contexts to the operation. The number of contexts provided by all participants should be the same and each participant should provide the same type of context. The newly created team uses the context for collective operations. If the communication context abstracts the resources for the library, the collective operations on this team uses the resources provided by the context. +\b Communication Contexts: Each process or a thread participating in the team creation operation contributes one or more communication contexts to the operation. The number of contexts provided by all participants should be the same and each participant should provide the same type of context. The newly created team uses the context for collective operations. If the communication context abstracts the resources for the library, the collective operations on this team uses the resources provided by the context. -\b Endpoints: That participants to the ucc\_team\_create\_post operation can provide an endpoint, a 64-bit unsigned integer. The endpoint is an address for communication. Each participant of the team has a unique integer as endpoint .i.e., the participants of the team do not share the same endpoint. For example, the user can bind the endpoint to the parallel programming model’s index such as OpenSHMEM PE, an OS process ID, or a thread ID. The UCC implementation can use the endpoint as an index to identify the resources required for communication such as communication contexts. When the user does not provide the endpoint, the library generates the endpoint, which can be queried by the user. In addition to the endpoint, the user can provide information about the endpoints such as whether the endpoint is a continuous range or not. +\b Endpoints: That participants to the ucc\_team\_create\_post operation can provide an endpoint, a 64-bit unsigned integer. The endpoint is an address for communication. Each participant of the team has a unique integer as endpoint .i.e., the participants of the team do not share the same endpoint. For example, the user can bind the endpoint to the parallel programming model’s index such as OpenSHMEM PE, an OS process ID, or a thread ID. The UCC implementation can use the endpoint as an index to identify the resources required for communication such as communication contexts. When the user does not provide the endpoint, the library generates the endpoint, which can be queried by the user. In addition to the endpoint, the user can provide information about the endpoints such as whether the endpoint is a continuous range or not. \b Ordering: The collective operations on the team can either be ordered or unordered. In the ordered model, the UCC collectives are invoked in order .i.e., on a given team, each of the participants of the collective operation invokes the operation in the same order. In the unordered model, the collective operations are not necessarily invoked in the same order. -\b Interaction with Threads: The team can be created in either mode .i.e., the library initialized by UCC\_LIB\_THREAD\_MULTIPLE, UCC\_LIB\_THREAD\_SINGLE, or UCC\_LIB\_THREAD\_FUNNEDLED. In the UCC\_LIB\_THREAD\_MULTIPLE mode, each of the user threads can post a collective operation. However, it is not valid to post concurrent collectives operations from multiple threads to the same team. +\b Interaction with Threads: The team can be created in either mode .i.e., the library initialized by UCC\_LIB\_THREAD\_MULTIPLE, UCC\_LIB\_THREAD\_SINGLE, or UCC\_LIB\_THREAD\_FUNNEDLED. In the UCC\_LIB\_THREAD\_MULTIPLE mode, each of the user threads can post a collective operation. However, it is not valid to post concurrent collectives operations from multiple threads to the same team. \b Memory per Team: A team can be configured by a memory descriptor described by ucc\_mem\_map\_params\_t structure. The memory can be used as an input and output buffers for the collective operation. This is particularly useful for PGAS programming models, where the input and output buffers are defined before the invocation operation. For example, the input and output buffers in the OpenSHMEM programming model are defined during the programming model initialization. @@ -32,4 +32,3 @@ The newly created team shares the communication contexts with the parent team. T The split operation can be called by multiple threads, if the parent team to the split operations are different and if it agrees with the thread model of the UCC library. Notes: The rationale behind requiring all participants of the parent team to participate in the split operation is to avoid overlapping participants between multiple split operations, which is known to increase the implementation complexity. Also, currently, higher-level programming models do not require these semantics. - diff --git a/docs/doxygen/ucc_doxygen.conf b/docs/doxygen/ucc_doxygen.conf index 53c3b5346e..a4ec35b7b7 100644 --- a/docs/doxygen/ucc_doxygen.conf +++ b/docs/doxygen/ucc_doxygen.conf @@ -882,7 +882,7 @@ INPUT_ENCODING = UTF-8 # *.vhdl, *.ucf, *.qsf and *.ice. FILE_PATTERNS = *.h \ - *.md + *.md # The RECURSIVE tag can be used to specify whether or not subdirectories should @@ -1007,7 +1007,7 @@ FILTER_SOURCE_PATTERNS = # (index.html). This can be useful if you have a project on for instance GitHub # and want to reuse the introduction page also for the doxygen output. -USE_MDFILE_AS_MAINPAGE = main.md +USE_MDFILE_AS_MAINPAGE = $(SRCDIR)/docs/doxygen/main.md #--------------------------------------------------------------------------- # Configuration options related to source browsing diff --git a/src/ucc/api/ucc.h b/src/ucc/api/ucc.h index 38a6d1fd3f..3257c79896 100644 --- a/src/ucc/api/ucc.h +++ b/src/ucc/api/ucc.h @@ -50,6 +50,13 @@ BEGIN_C_DECLS * @} */ +/** + * @defgroup UCC_LIB_INTERNAL Internal library routines + * @{ + * Internal library routines + * @} + */ + /** * @defgroup UCC_CONTEXT_DT Context abstraction data-structures * @{ diff --git a/src/ucc/api/ucc_def.h b/src/ucc/api/ucc_def.h index 66005375be..edb792992c 100644 --- a/src/ucc/api/ucc_def.h +++ b/src/ucc/api/ucc_def.h @@ -151,7 +151,7 @@ typedef size_t ucc_context_addr_len_t; /** - * @ingroup UCC_EVENTS_DT + * @ingroup UCC_EVENT_DT * @brief UCC execution engine handle * * The UCC execution engine handle is an opaque handle created by the library representing @@ -161,7 +161,7 @@ typedef struct ucc_ee* ucc_ee_h; typedef struct ucc_dt_generic ucc_dt_generic_t; /** - * @ingroup UCC_DATATYPES + * @ingroup UCC_DATATYPE * @brief Helper enum for generic/predefined datatype representation * */