Symlinks in Globus¶
Warning
Globus usually does not follow or copy symbolic links (“symlinks”). DESI data does contain symlinks, so we have documented some suggestions for dealing with these below.
The instructions immediately below apply to DESI DR1 and are intended to cover the case where symlinks need to be re-created after a Globus transfer, or otherwise avoided.
- The top-level
vac/
directory does not contain symlinks so no additional action is required. - The top-level
target/
directory contains several symlinks, and there is a simple workaround for these. These directories are:target/catalogs/
,target/secondary/
,target/masks/
,target/skybricks/v2/
,target/skybricks/v3/
,target/skyhealpixs/v1/
, andtarget/fiberassign/tiles/tags/0.5/
. Globus will follow symlinks in this case if data is downloaded via a transfer “inside” these directories. In other words, point the Globus interface attarget/catalogs/
, for example, and download the contents of the directory. - The
target/catalogs/
directory also contains internal symlinks, but these can be re-created by hand if necessary. See the instructions below. - The
survey/catalogs/
directory, which contains the Large Scale Structure (LSS) catalogs, does not contain symlinks. However, there are several symlinks in directories that contain intermediate data products that would normally be of interest only to experts:survey/GFA/
andsurvey/fiberassign/
. - Finally, the directory
spectro/redux/iron/exposures/
contains many symlinks. This directory contains intermediate data; final spectroscopic outputs and redshift catalogs are stored elsewhere. First, consider whether you need this portion of the data set. If you do want to copy this directory, refer to the instructions below. This also applies to theguadalupe
reduction, i.e.spectro/redux/guadalupe/exposures/
.
Note
There are no symlinks inside spectro/redux/iron/zcatalog/
,
spectro/redux/iron/healpix/
, spectro/redux/iron/tiles/
, and vac/
,
which are the primary locations that users might download in bulk,
so these instructions are only needed for more expert-level files.
Note
For the DESI EDR data set, please see Symlink Instructions for DESI EDR.
Recreating Targeting Symlinks¶
Assuming you have copied the entire target/
directory, the shell commands below will re-create all symlinks in the target/
directory.
cd target/catalogs
(cd gaiadr2/2.2.0; ln -s ../1.1.1/skies-supp skies-supp)
(cd dr9/1.1.1/targets; ln -s ../../1.3.0/targets/main2 main2)
(cd dr9/0.49.0; ln -s ../0.50.0/pixweight pixweight)
(cd dr9/2.2.0; ln -s ../1.1.1/gfas gfas)
(cd dr9/2.2.0; ln -s ../1.1.1/skies skies)
Copying Exposures Symlinks¶
Warning
A NERSC account is required to follow these instructions.
If you need bulk download of the intermediate spectroscopic pipeline
files in spectro/redux/iron/exposures/
or links under survey/
, contact the Data Management
lead listed under the DESI Operations Team Contact List
First, when using this technique, you must “preserve modification times” when performing the Globus transfer.
Assuming you have aready downloaded spectro/redux/iron/exposures/
, cd
to the iron/
directory and run this command:
rsync -rlvtO --dry-run dtn01.nersc.gov:/global/cfs/cdirs/desi/public/dr1/spectro/redux/iron/exposures/ exposures/
--dry-run
option to actually transfer the symlinks.
Links under survey/GFA
and survey/fiberassign
require additional work. Contact the Data Management
lead mentioned above if you need these files and don’t have access to NERSC to directly inspect them.
Note
If you are working with DESI EDR data, replace iron
with fuji
and dr1
with edr
in the rsync command.
Symlink Instructions for DESI EDR¶
- The top-level
survey/
andvac/
directories do not contain symlinks so no additional action is required. Ignore any instructions related to thesurvey/
directory above. - The top-level
target/
directory contains exactly two symlinks:target/catalogs/
andtarget/secondary/
. Globus will follow symlinks in this case if data is downloaded via a transfer “inside” these directories. In other words, point the Globus interface attarget/catalogs/
ortarget/secondary/
and download the contents of those directories. - The
target/catalogs/
directory also contains internal symlinks, but these can be re-created by hand if necessary. See the instructions above. - Finally, the directory
spectro/redux/fuji/exposures/
contains many symlinks. This directory contains intermediate data; final spectroscopic outputs and redshift catalogs are stored elsewhere. First, consider whether you need this portion of the data set. If you do want to copy this directory, refer to the instructions above.
Note
There are no symlinks inside spectro/redux/fuji/zcatalog/
,
spectro/redux/fuji/healpix/
, spectro/redux/fuji/tiles/
, and vac/
,
which are the primary locations that users might download in bulk,
so these instructions are only needed for more expert-level files.