minor changes for JOSS publication (#955)

danielskatz · tpadioleau · tpadioleau · commit 113f78bf764c · 2025-10-31T13:46:36.000+01:00
* Update paper.md

* Update paper.bib

---------

Co-authored-by: Thomas Padioleau &lt;thomas.padioleau@cea.fr&gt;
diff --git a/paper/paper.bib b/paper/paper.bib
@@ -73,7 +73,7 @@ @article{hoyer2017xarray
 }
 
 @article{grandgirard2016,
-title = {A 5D gyrokinetic full-f global semi-Lagrangian code for flux-driven ion turbulence simulations},
+title = {A 5D gyrokinetic full-f global semi-{L}agrangian code for flux-driven ion turbulence simulations},
 journal = {Computer Physics Communications},
 volume = {207},
 pages = {35-68},
diff --git a/paper/paper.md b/paper/paper.md
@@ -1,35 +1,25 @@
-# DDC: The Discrete Domain Computation library
-
 ---
 title: 'DDC: The Discrete Domain Computation library'
 tags:
-
 - C++
 - HPC
 - labelled arrays
 - xarray
-
 authors:
-
 - name: Thomas Padioleau
   orcid: 0000-0001-5496-0013
   equal-contrib: true
   affiliation: 1
-
 - name: Julien Bigot
   orcid: 0000-0002-0015-4304
   affiliation: 1
-
 - name: Emily Bourne
   orcid: 0000-0002-3469-2338
   affiliation: 2
-
 - name: Baptiste Legouix
   orcid: 0009-0006-7585-669X
   affiliation: 3
-
 affiliations:
-
 - name: Université Paris-Saclay, UVSQ, CNRS, CEA, Maison de la Simulation, 91191, Gif-sur-Yvette, France
   index: 1
 - name: SCITAS, EPFL, CH-1015 Lausanne, Switzerland
@@ -38,7 +28,6 @@ affiliations:
   index: 3
 date: 31 October 2025
 bibliography: paper.bib
-
 ---
 
 ## Summary
@@ -51,11 +40,11 @@ The use of multidimensional arrays is widespread across various fields, particul
 
 Many programming languages commonly used in scientific computing support multidimensional arrays in different ways. Fortran, a longstanding choice in the field, and Julia, a more recent language, both natively support these data structures. In contrast, the Python ecosystem relies on the popular NumPy library’s `numpy.Array` [@harris2020array]. Meanwhile, C++23 introduced `std::mdspan` to the standard library. This container was inspired by `Kokkos::View` from the Kokkos library which also serves as the foundation of DDC.
 
-Despite their importance, multidimensional arrays introduce several practical challenges. In a sense, they encourage the usage of implicit information in the source code. A frequent source of errors is the inadvertent swapping of indices when accessing elements. Such errors can be difficult to detect, especially given the common convention of using single-letter variable names like `i` and `j` for indexing. Another challenge in medium to large codebases is the lack of semantic clarity in function signatures when using raw multidimensional arrays. When array dimensions carry specific meanings, this information is not explicitly represented in the source code, leaving it up to the user to ensure that dimensions are ordered correctly according to implicit expectations. For example it is quite usual to use the same index for multiple interpretations: looping over mesh cells identified by `i` and interpreting `i+1` as the face to the right. Another example is slicing that removes dimensions, this can shift the positions of remaining dimensions, altering the correspondence between axis indices and their semantic meanings.
+Despite their importance, multidimensional arrays introduce several practical challenges. In a sense, they encourage the usage of implicit information in the source code. A frequent source of errors is the inadvertent swapping of indices when accessing elements. Such errors can be difficult to detect, especially given the common convention of using single-letter variable names like `i` and `j` for indexing. Another challenge in medium-to-large codebases is the lack of semantic clarity in function signatures when using raw multidimensional arrays. When array dimensions carry specific meanings, this information is not explicitly represented in the source code, leaving it up to the user to ensure that dimensions are ordered correctly according to implicit expectations. For example it is quite usual to use the same index for multiple interpretations: looping over mesh cells identified by `i` and interpreting `i+1` as the face to the right. Another example is slicing that removes dimensions, this can shift the positions of remaining dimensions, altering the correspondence between axis indices and their semantic meanings.
 
-Solutions have been proposed to address these issues. For example in Python, the Xarray [@hoyer2017xarray] library allows users to label dimensions that can then be used to perform computations. Following a similar approach, the "Discrete Domain Computation" (DDC) library aims to bring equivalent functionality to the C++ ecosystem. It uses a zero overhead abstraction approach, i.e. with labels fixed at compile-time, on top of different performant portable libraries, such as: Kokkos [@9485033, @9502936], Kokkos Kernels [@rajamanickam2021kokkos], kokkos-fft [@kokkos-fft] and Ginkgo [@GinkgoJoss2020]. Labelling at compile time is achieved by strongly typing dimensions, an approach similar to that used in units libraries such as mp-units [@Pusz_mp-units_2024], which strongly type quantities rather than dimensions.
+Solutions have been proposed to address these issues. For example, in Python, the Xarray [@hoyer2017xarray] library allows users to label dimensions that can then be used to perform computations. Following a similar approach, the "Discrete Domain Computation" (DDC) library aims to bring equivalent functionality to the C++ ecosystem. It uses a zero overhead abstraction approach, i.e., with labels fixed at compile-time, on top of different performant portable libraries, such as Kokkos [@9485033, @9502936], Kokkos Kernels [@rajamanickam2021kokkos], kokkos-fft [@kokkos-fft], and Ginkgo [@GinkgoJoss2020]. Labelling at compile time is achieved by strongly typing dimensions, an approach similar to that used in units libraries such as mp-units [@Pusz_mp-units_2024], which strongly type quantities rather than dimensions.
 
-The library is actively used to modernize the Fortran-based Gysela plasma simulation code [@Bourne_Gyselalib]. This simulation code relies heavily on high-dimensional arrays. While the data stored in the arrays has 7 dimensions, each dimension can have multiple representations, including Fourier, spline, Cartesian, and various curvilinear meshes. The legacy Fortran implementation, used to manipulate multi-dimensional arrays that stored slices of all the possible dimensions with very limited information about which dimensions were actually represented to enforce correctness at the API level. DDC enables a more explicit, strongly-typed representation of these arrays, ensuring at compile-time that function calls respect the expected dimensions. This reduces indexing errors and improves code maintainability, particularly in large-scale scientific software.
+The library is actively used to modernize the Fortran-based Gysela plasma simulation code [@Bourne_Gyselalib]. This simulation code relies heavily on high-dimensional arrays. While the data stored in the arrays has 7 dimensions, each dimension can have multiple representations, including Fourier, spline, Cartesian, and various curvilinear meshes. The legacy Fortran implementation was used to manipulate multi-dimensional arrays that stored slices of all the possible dimensions with very limited information about which dimensions were actually represented to enforce correctness at the API level. DDC enables a more explicit, strongly-typed representation of these arrays, ensuring at compile-time that function calls respect the expected dimensions. This reduces indexing errors and improves code maintainability, particularly in large-scale scientific software.
 
 ## DDC Core key features
 
@@ -78,7 +67,7 @@ In a DDC container, `DiscreteElement` indices represent absolute positions, whil
 
 ![Example of two sets of `DiscreteElement`.\label{fig:domains}](domains.pdf)
 
-For example consider \autoref{fig:domains} that illustrates a two-dimensional data chunk with axes `X` and `Y`. Here `chunk_r` is a container defined over the red area and `chunk_b` is a slice of `chunk_r` over the blue area. Let us define
+For example, consider \autoref{fig:domains} that illustrates a two-dimensional data chunk with axes `X` and `Y`. Here `chunk_r` is a container defined over the red area and `chunk_b` is a slice of `chunk_r` over the blue area. Let us define
 
 - `DiscreteElement<X, Y> e(x_c, y_b)`,
 - `DiscreteVector<X, Y> v(2, 1)`,
@@ -95,7 +84,7 @@ This highlights the fact that `DiscreteElement` provides a globally consistent i
 
 ### Sets of `DiscreteElement`
 
-The semantics of DDC containers associates data to a set of `DiscreteElement` indices. Let us note that the set of all possible `DiscreteElement` has a total order that is typically established once and for all at program initialization. Thus to be able to construct a DDC container one must provide a multidimensional set of `DiscreteElement` indices, only these indices can be later used to access the container’s data.
+The semantics of DDC containers associates data to a set of `DiscreteElement` indices. Let us note that the set of all possible `DiscreteElement` has a total order that is typically established once and for all at program initialization. Thus, to be able to construct a DDC container, one must provide a multidimensional set of `DiscreteElement` indices, where only these indices can be later used to access the container’s data.
 
 The library provides several ways to group `DiscreteElement` into sets, each represented as a Cartesian product of per-dimension sets. These sets offer a lookup function to retrieve the position of a multi-index relative to the front of the set. The performance of container data access depends significantly on the compile-time properties of the set used.
 

Original file line number	Diff line number	Diff line change
`@@ -73,7 +73,7 @@ @article{hoyer2017xarray`
`73`	`73`	`}`
`74`	`74`
`75`	`75`	`@article{grandgirard2016,`
`76`		`-title = {A 5D gyrokinetic full-f global semi-Lagrangian code for flux-driven ion turbulence simulations},`
	`76`	`+title = {A 5D gyrokinetic full-f global semi-{L}agrangian code for flux-driven ion turbulence simulations},`
`77`	`77`	`journal = {Computer Physics Communications},`
`78`	`78`	`volume = {207},`
`79`	`79`	`pages = {35-68},`