From 66f1d2d9cdc414672d544c66897149d9fe009a86 Mon Sep 17 00:00:00 2001
From: David Minton <daminton@purdue.edu>
Date: Fri, 1 Mar 2024 07:54:09 -0500
Subject: [PATCH] Revised the basic simulation user guide with more detail.

---
 docs/user-guide/basic-simulation/index.rst | 88 ++++++++++++++++++----
 1 file changed, 73 insertions(+), 15 deletions(-)

diff --git a/docs/user-guide/basic-simulation/index.rst b/docs/user-guide/basic-simulation/index.rst
index 0d37f808d..4863764e8 100644
--- a/docs/user-guide/basic-simulation/index.rst
+++ b/docs/user-guide/basic-simulation/index.rst
@@ -3,7 +3,6 @@ Basic Simulation
 #################
 
 Here, we will walk you through the basic features of Swiftest and using them in Python. 
-This is based on ``/Basic_Simulation`` in ``swiftest/examples``.
 
 Start with importing Swiftest. ::
     
@@ -18,31 +17,78 @@ Outputs are stored in the ``./simdata`` directory by default. ::
    sim = swiftest.Simulation()
 
 Now that we have a simulation object set up (with default parameters), we can add bodies to the simulation. 
-The biggest body in the simulation is taken as the central body. Swiftest sets a Simulation object up with a set of default parameters, 
-including a default unit system of AU, y, and solar masses.
+The biggest body in the simulation is taken as the central body. 
 
 Solar System Bodies
 =========================
 
-We can add solar system bodies to the simulation using the :func:`add_solar_system_body <swiftest.Simulation.add_solar_system_body>` method. 
-This method uses JPL Horizons to extract the parameters. ::
+We can add solar system bodies to the simulation using the :func:`add_solar_system_body <swiftest.Simulation.add_solar_system_body>` 
+method.  This method uses JPL Horizons to extract the parameters. ::
    
    # Add the modern planets and the Sun using the JPL Horizons Database.
    sim.add_solar_system_body(["Sun","Mercury","Venus","Earth","Mars","Jupiter","Saturn","Uranus","Neptune"])
 
-Add other small bodies too: ::
-
-   # Add in some main belt asteroids
-   sim.add_solar_system_body(name=["Ceres","Vesta","Pallas","Hygiea"],id_type="smallbody")
-
-   # Add in some big KBOs and Centaurs
-   sim.add_solar_system_body(name=["Pluto","Eris","Haumea","Quaoar", "Chiron","Chariklo"])
 
 Running the Simulation
 ========================
 
-We now can some simulation parameters using the :func:`set_parameter <swiftest.Simulation.set_parameter>` method. 
-Here we have a simulation that runs for 1 My a step size of 0.01 y. We will also save the system every 1000 y and wait until the end of the simulation to write the simulation data to file using the ``dump_cadence=0`` argument ::
+Before we run the simulation, we need to set some parameters to control the total length and step size of an integration. Swiftest 
+sets a Simulation object up with a set of default parameters, including a default unit system of AU, y, and solar masses. However, 
+you are required to set the ``tstop`` and ``dt`` parameters before running the simulation. These control the total time of the 
+simulation and the time step size, respectively.
+
+.. note::
+    The symplectic integrators used in Swiftest are not adaptive, so the time step size is fixed throughout the simulation.
+    Typically you want to choose a step size that is no more than 1/20th of the shortest orbital period in the system. So for the
+    solar system, a step size of 0.01 y is a good choice in order to accurately model Mercury's orbit.
+
+Another important consideration is the number of steps you wish to save and how often the output is saved to file (the output 
+cadence). By default, Swiftest will save every output step to disk. However, Swiftest is designed to simulate systems for long 
+periods of time, so it is often not practical to save every single time step to disk. There are three ways to control how many 
+steps are saved to file: ``tstep_out``, ``istep_out``, and ``nstep_out``.
+
+- ``istep_out``: This is the integer number of time steps between output saves to file, which can be used to control the output 
+  cadence.  For example, if you set ``istep_out=1000``, then the simulation will save the system every 1000 time steps. This is 
+  useful if you want to save the system every N time steps, regardless of the time interval between steps. 
+
+- ``tstep_out``: This is the approximate time interval between output steps. This is the most intuitive way to control the output 
+  cadence. It is the time interval between each output step in the simulation. For example, if you set ``tstep_out=1e3``, then the 
+  simulation will save the system every 1000 y. Internally, Swiftest uses the integer value ``istep_out`` to control the output 
+  cadence, which is computed as::
+
+    istep_out = floor(tstep_out/dt) 
+
+  Only one of either ``tstep_out`` or ``istep_out`` should be set.
+
+- ``nstep_out``: The total number of times that outputs are written to file. Passing this allows for a geometric progression of 
+  output steps, given by the following formula::
+
+        TSTART, f**0 * TSTEP_OUT, f**1 * TSTEP_OUT, f**2 * TSTEP_OUT, ..., f**(nstep_out-1) * TSTEP_OUT
+
+  where ``f`` is a factor that can stretch (or shrink) the time between outputs. Setting::
+
+        nstep_out = int((tstart - tstop) / (tstep_out))
+  
+  is equivalent to the standard linear output (i.e. ``f==1``) and is the same as not passing anything for this argument. 
+
+Simulation data is stored in NetCDF format, which is a self-describing binary file format that is widely used in the scientific
+community. However, writing to disk is a slow process, so writing to disk can be a bottleneck in the simulation. To mitigate this,
+Swiftest has a ``dump_cadence`` parameter that controls how often the simulation data is written to disk. The integer value passed 
+to ``dump_cadence`` controls the number of output steps (as determined ``istep_out``) between when the saved data is dumped to a 
+file. The default value is 10, which means that Swiftest will store 10 outputs in memory before dumping them to file. 
+Setting ``dump_cadence`` to 0 is a a special case that tells Swiftest to store *all* output in memory until the end of the 
+simulation. This is useful for short simulations, but can be memory intensive for long simulations. 
+
+The choice of what values to set for ``tstep_out`` (or ``istep_out``), ``nstep_out``, and ``dump_cadence`` depends on the particular
+simulation. Higher values of ``dump_cadence`` are typically useful for simulations with small numbers of bodies and small values
+of ```tstep_out`` where frequent writing to disk can severely impact performance. For simulations with large numbers of bodies and 
+larger values of ``tstep_out``, it is often better to set ``dump_cadence`` to a smaller value and write the data to disk more often
+so that the memory usage does not become too large. The default value of ``dump_cadence`` of 10 is a good compromise for most use
+caes.
+
+We can set these simulation parameters using the :func:`set_parameter <swiftest.Simulation.set_parameter>` method. 
+Here we have a simulation that runs for 1 My a step size of 0.01 y. We will also save the system every 1000 y and wait until the end
+of the simulation to write the simulation data to file using the ``dump_cadence=0`` argument::
 
     sim.set_parameter(tstop=1.0e6, tstep_out=1e3, dt=0.01, dump_cadence=0)
 
@@ -66,12 +112,24 @@ So the following are all equivalent::
     sim.add_solar_system_body(["Sun","Mercury","Venus","Earth","Mars","Jupiter","Saturn","Uranus","Neptune"])
     sim.run(tstop=1.0e6, tstep_out=1e3, dt=0.01, dump_cadence=0)
 
+.. note::
+    Swiftest uses OpenMP parallelization to help speed up the integration, however the parallelization is most effective when there
+    are large numbers of bodies in the simulation. For small numbers of bodies, the overhead of parallelization can actually slow
+    the simulation down. The number of threads used by Swiftest can be controlled using the ``OMP_NUM_THREADS`` environment
+    variable. For example, to use 4 threads, you can set the environment variable using the following command in a Unix-like shell::
+
+        export OMP_NUM_THREADS=4
+
+    For our example simulation, which only includes the solar system, it is best to run the simulation with a single thread. We plan
+    to build in an adaptive thread control in the future, but for now, you must time your simulations and set the number of threads
+    manually.
 
 Analayzing Simulation Output
 =============================
 
 Once a simulation has been run, its output data is stored in the ``./simdata`` directory. The main data is stored in a file with a 
-default name of ``data.nc``, which is a netCDF file. It is read in and stored as an `Xarray Dataset <https://docs.xarray.dev/en/stable/>`__ object in the ``sim.data`` attribute.
+default name of ``data.nc``, which is a netCDF file. It is read in and stored as an 
+`Xarray Dataset <https://docs.xarray.dev/en/stable/>`__ object in the ``sim.data`` attribute.
 
 
 .. .. toctree::