README for the 4-Dimensional Assimilated Data Set The purpose of the following README file is to assist you to read the data. For scientific content of the data, please refer to the online Guide document. 1. DATA SET INFORMATION The 4-D Assimilated Data Set is a 4-month, global, gridded atmospheric data set for use in large-scale atmospheric research. The Laboratory for Atmospheres Data Assimilation Office (DAO) at Goddard Space Flight Center (GSFC) produced this data set by assimilating rawinsonde reports, satellite retrievals of geopotential thickness, cloud-motion winds, aircraft, ship, and rocketsonde reports with model forecasts employing version 1 of the Goddard Earth Observing System (GEOS-1) atmospheric general circulation model (GCM). At the lower boundary, the assimilating GCM is constrained by the monthly mean observed sea surface temperature and climatological soil moisture derived from monthly mean observed surface air temperature and precipitation fields. The data are well suited for process studies and intraseasonal variability research since they are produced by a fixed assimilation system designed to minimize spinup in the hydrological cycle. By using a nonvarying system the variability due to algorithm change is eliminated and geophysical variability can be more confidently isolated. The data archived at the Goddard Distributed Active Archive Center (DAAC) are a subset of the full data set produced by the DAO. The data are organized chronologically in a time series format to facilitate the computation of statistics spanning long time periods. Data are available for the time period November 1992 through February 1993. Data cover the geographical area 26N-26S and 40E-100W. These data will be online. An expanded subset is available covering the geographical area 50S-50N and 180W-180E. In calculating variables for this data set, the DAO makes a distinction between PROGNOSTIC and DIAGNOSTIC variables. A prognostic parameter is an atmospheric variable that the model forecasts. During the assimilation these are the parameters most directly influenced by the observations. Diagnostic parameters are generally not measured, but are calculated by the model in a manner consistent with the observations. Examples of prognostic parameters include wind, temperature, and humidity. Surface fields include albedo, pressure, wetness, temperature, and ice/water flags. Both upper air and surface prognostic parameters are sampled every 6 hours, though it must be kept in mind that in the current version of the GCM (GEOS-1) some of the surface fields (e.g. albedo, sea surface temperature) are specified from temporally interpolated monthly mean values while the soil moisture is specified using climatology; the fields are saved more frequently simply for convenience. There are also many time averaged diagnostic parameters (e.g., heating rates, precipitation, surface heat fluxes, radiative fluxes at the top and bottom of the atmosphere, and upper air temperature and moisture tendencies) provided either 8 or 4 times daily depending upon whether the parameter is a surface or single level or upper air quantity. 2. DATA SOURCE INFORMATION The assimilated data are a synthesis of observations and short-term model forecasts. Data were collected from globally deployed in situ and remote observations throughout the assimilation period. The platforms used to collect observations are: 1 - Tiros Operational Vertical Sounder (TOVS) (NOAA/NESDIS thickness retrievals) 2 - Ships and Buoys 3 - Surface synoptic reports over land 4 - Rawinsondes, dropwindsondes, rocketsondes 5 - Aircraft (wind measurements) 6 - Cloud-motion winds Sources 1, 4, 5, and 6 are used in the upper air analyses of height and wind, while the moisture analysis uses only rawinsonde reports. Sources 2 and 3 are used in the determination of sea level pressure and near-surface wind analysis over the oceans. It should be noted that for this DAO TOGA COARE analysis all wind and sea level pressure data collected during COARE (that was received over the GTS) were assimilated. TOGA COARE upper air moisture and temperature data were withheld. The analysis scheme is carried out at a horizontal resolution of 2 degrees latitude by 2.5 degrees longitude at 14 upper-air pressure levels (20, 30, 50, 70, 100, 150, 200, 250, 300, 400, 500, 700, 850, 1000 mb) and at sea level. Note, however, that the assimilated data are provided at 18 pressure levels with higher vertical resolution in the lower troposphere (see Section 3). The analysis increments are computed every 6 hours using observations from a +/- 3 hour data window centered on the analysis times (00, 06, 12, and 18 UTC). The model computes a 3-hour forecast from an initial reference time. This forecast is compared to its observational counterpart (which was collected during a 6 hour interval centered on the forecast time). An estimate of the forecast error is computed and divided into incremental updates. The model is then re-run from the initial reference time for 6 hours with the incremental updates applied to the simulation. Then another three-hour forecast is produced and the cycle is repeated. It is the model integration forced by the incremental updates that makes up the assimilated data provided here. For more detailed information on the assimilation system, please refer to the appropriate references in section 7 of this document. 3. DATA SET ORGANIZATION There are 22 different files for each month in the assimilation period, for a total of 88 data files over the 4-month run. Associated with every data file is a short ASCII "table" file containing descriptive information to allow data interpretation by the Grid Analysis and Display System (GrADS) visualization package developed at the University of Maryland (see section 5 for more information on how to obtain the GrADS software). The table file includes the name of the associated data file, the x, y, and z coordinate scales, the value used for undefined data, the number of time periods, the date, and a list of the parameters contained in the data file. Since the data files contain only binary science data, all spatial, temporal and parameter information for these files must be inferred from the file name or the contents of the accompanying table file. The file naming conventions are as follows: Data file : edvl049.prs.xxxxxxx_ss.byymmdd.eyymmdd Table file : edvl049.tabl.xxxxxxx_ss.byymmdd.eyymmdd where byymmdd, eyymmdd are the beginning and ending year, month, and day, respectively (spanning 1 month, e.g., b921101.e921130), and xxxxxxx denotes the parameters contained in that file. These parameters are drawn from the designations given in the third column of the list below. The first part of the file names, edvl049.prs and edvl049.tabl, remain the same for all files. Each file contains parameters for all days of the specified month. The parameters are listed below along with their corresponding units and filename designator: Surface prognostic fields Units File Location ------------------------- ----- ------------- SURFACE GEOPOTENTIAL HEIGHTS (G * M) (m/s)^2 sfcprog SURFACE ALBEDO (0-1) sfcprog SURFACE GROUND WETNESS (0-1) sfcprog SURFACE PRESSURE - PTOP (mb) sfcprog SURFACE GROUND TEMPERATURE (K) sfcprog SEA LEVEL PRESSURE (mb) sfcprog LAND_(2), WATER_(1), ICE_(3) FLAGS - sfcprog VERTICALLY INTEGRATED (BAROTROPIC) U-WIND (m/s) sfcprog VERTICALLY INTEGRATED (BAROTROPIC) V-WIND (m/s) sfcprog Upper air prognostic fields (18 levels) Units File Location --------------------------- ----- ------------- U-WIND (m/s) uwnd V-WIND (m/s) vwnd GEOPOTENTIAL HEIGHT (m) hght TEMPERATURE (K) tmpu SPECIFIC HUMIDITY (g/kg) sphu TURBULENT KINETIC ENERGY (m/s)^2 qq STANDARD DEVIATION OF HEIGHT ERROR (m) hghte Surface diagnostic fields (group 1) Units File Location ----------------------------------- ----- ------------- SURFACE PRESSURE - PTOP (PTOP= 10mb) (mb) diag1 TOTAL PRECIPITATION (mm/day) diag1 CONVECTIVE PRECIPITATION (mm/day) diag1 SURFACE EVAPORATION (mm/day) diag1 VERTICALLY AVERAGED U*Q (m/s g/kg) diag1 VERTICALLY AVERAGED V*Q (m/s g/kg) diag1 VERTICALLY AVERAGED U*T (m/s K) diag1 VERTICALLY AVERAGED V*T (m/s K) diag1 TOTAL PRECIPITABLE WATER (g/cm^2) diag1 Surface diagnostic fields (group 2) Units File Location ----------------------------------- ----- ------------- SURFACE PRESSURE - PTOP (PTOP= 10mb) (mb) diag2 U-MOMENTUM SURFACE STRESS (N/m^2) diag2 V-MOMENTUM SURFACE STRESS (N/m^2) diag2 SURFACE FLUX OF SENSIBLE HEAT (W/m^2) diag2 SURFACE DRAG COEF. FOR T AND Q - diag2 SURFACE DRAG COEF. FOR U AND V - diag2 SURFACE WIND SPEED (m/s) diag2 FRICTION VELOCITY USTAR (m/s) diag2 SURFACE ROUGHNESS Z0 (m) diag2 PBL DEPTH (mb) diag2 Surface diagnostic fields (group 3) Units File Location ----------------------------------- ----- ------------- SURFACE PRESSURE - PTOP (PTOP= 10mb) (mb) diag3 NET UPWARD LW RADIATION AT GROUND (W/m^2) diag3 NET DOWNWARD SW RADIATION AT GROUND (W/m^2) diag3 OUTGOING LONGWAVE RADIATION (W/m^2) diag3 OUTGOING LONGWAVE RADIATION CLEAR SKY (W/m^2) diag3 SURFACE LONGWAVE FLUX CLEAR SKY (W/m^2) diag3 INCIDENT SW RADIATION AT TOP OF ATMOS (W/m^2) diag3 OUTGOING SHORTWAVE RADIATION (W/m^2) diag3 OUTGOING SHORTWAVE RADIATION CLEAR SKY (W/m^2) diag3 SURFACE SHORTWAVE FLUX CLEAR SKY (W/m^2) diag3 2-DIMENSIONAL TOTAL CLOUD FRACTION (0-1) diag3 Surface diagnostic fields (group 4) Units File Location ----------------------------------- ----- ------------- SURFACE PRESSURE - PTOP (PTOP= 10mb) (mb) diag4 GROUND TEMPERATURE (K) diag4 SURFACE AIR TEMPERATURE (K) diag4 SATURATION SURFACE SPECIFIC HUMIDITY (g/kg) diag4 SURFACE PRESSURE TENDENCY (mb/day) diag4 U AT 2 METERS (m/s) diag4 V AT 2 METERS (m/s) diag4 T AT 2 METERS (K) diag4 Q AT 2 METERS (kg/kg) diag4 U AT 10 METERS (m/s) diag4 V AT 10 METERS (m/s) diag4 T AT 10 METERS (K) diag4 Q AT 10 METERS (kg/kg) diag4 Upper air diagnostic fields (18 levels) Units File Location --------------------------------------- ----- ------------- U-MOMENTUM CHANGES DUE TO TURBULENCE (m/s/day) turbu V-MOMENTUM CHANGES DUE TO TURBULENCE (m/s/day) turbv MOISTURE CHANGES DUE TO TURBULENCE (g/kg/day) turbq TEMPERATURE CHANGES DUE TO TURBULENCE (K/day) turbt TEMPERATURE CHANGES DUE TO MOIST PROCESSES (K/day) moistt MOISTURE CHANGES DUE TO MOIST PROCESSES (g/kg/day) moistq TEMPERATURE CHANGES DUE TO LW RADIATION (K/day) radlw TEMPERATURE CHANGES DUE TO SW RADIATION (K/day) radsw VERTICAL VELOCITY (mb/day) omega FILLING OF NEGATIVE SPECIFIC HUMIDITY (g/kg/day) qfill Thus, surface diagnostics are contained in 4 files, surface prognostics in 1 file, upper air prognostics in 7 files, and upper air diagnostics in 10 files. Each upper air prognostic file or each upper air diagnostic file will contain a single parameter for 18 pressure levels. These levels are located at 1000, 950, 900, 850, 800, 700, 600, 500, 400, 300, 250, 200, 150, 100, 70, 50, 30, and 20 mb. On the other hand, each surface prognostic file or each surface diagnostic file will contain all the parameters in that category evaluated at the surface or the vertical average through the atmosphere. A summary of the individual file characteristics is shown in the following table: # Reporting # Reporting File size (bytes) File # Parameters levels times/day (31-day month) ---- ------------ ------------ ------------ -------------- 1. sfcprog 9 1 4 10735920 2. uwnd 1 18 4 21471840 3. vwnd 1 18 4 21471840 4. hght 1 18 4 21471840 5. tmpu 1 18 4 21471840 6. sphu 1 18 4 21471840 7. qq 1 18 4 21471840 8. hghte 1 18 4 16700320 9. diag1 9 1 8 21471840 10. diag2 10 1 8 23857600 11. diag3 11 1 8 26243360 12. diag4 13 1 8 31014880 13. turbu 1 18 4 21471840 14. turbv 1 18 4 21471840 15. turbt 1 18 4 21471840 16. turbq 1 18 4 21471840 17. moistt 1 18 4 21471840 18. moistq 1 18 4 21471840 19. radlw 1 18 4 21471840 20. radsw 1 18 4 21471840 21. omega 1 18 4 21471840 22. qfill 1 18 4 21471840 All prognostic quantities (in files 1-8 above) represent instantaneous values at the designated synoptic time, i.e., at 00Z, 06Z, 12Z or 18Z. The surface diagnostics (in files 9-12), which are reported 8 times daily at 00Z, 03Z, 06Z, etc., are averages over the 3 hour period immediately prior to the designated time tag. For example, a tag of 18Z denotes an average from 15Z-18Z. The upper air diagnostics (in files 13-22) refer to averages over a 6 hour period centered on the designated time tag (i.e., a tag of 18Z denotes an average from 15Z-21Z). All fields are written in a time-series format, i.e., all global parameter arrays are written out for each daily time period in sequence and then for each day of the month. As an example, for surface prognostics for November 1992, the data and table file names will be: edvl049.prs.sfcprog_ss.b921101.e921130 edvl049.tabl.sfcprog_ss.b921101.e921130 with the data file having the following organization: November 1 00Z field 1 ----> field 9 (geopotential height) (vert. int. barotropic vwind) November 1 06Z field 1 ----> field 9 (geopotential height) (vert. int. barotropic vwind) November 1 12Z field 1 ----> field 9 (geopotential height) (vert. int. barotropic vwind) November 1 18Z field 1 ----> field 9 (geopotential height) (vert. int. barotropic vwind) followed by the same geophysical fields for November 2, November 3,..., November 30. In the case of the upper air prognostic variable u-wind, the files would be called: edvl049.prs.uwnd_ss.b921101.e921130 edvl049.tabl.uwnd_ss.b921101.e921130 with the data file having the following organization: November 1 00Z field 1 ----> field 18 (1000mb U-wind) (20 mb U-wind) November 1 06Z field 1 ----> field 18 (1000mb U-wind) (20mb U-wind) November 1 12Z field 1 ----> field 18 (1000mb U-wind) (20mb U-wind) November 1 18Z field 1 ----> field 18 (1000mb U-wind) (20mb U-wind) followed by the same geophysical parameters for November 2, November 3,..., November 30. The logical organization of an upper air diagnostic file will be similar to the upper air prognostic file shown above, while the surface diagnostic files (diag1, diag2, diag3, diag4) will be similar to the surface prognostic files except that there will be twice the number of time periods (3 hour increments) and a different number of fields (9 for diag1, 10 for diag2, 11 for diag3, 13 for diag4 (see above table listings)). For each field in a file, there are 89 grid points in longitude with the first grid point at 40E and with a grid spacing of 2.5 degrees. There are 27 grid points in latitude with the first grid point at 26S and with a grid spacing of 2.0 degrees. The data are stored such that all values along the first latitude (26S) are written first, followed by data from the next northernmost latitude (24S), and continuing on to the last latitude at 26N. For longitude, the first data point is 40E followed by 42.5E and so on to the 89th grid point at 100W. All data are written as IEEE 32-bit floating point numbers. 4. DATA ACCESS INFORMATION The data are IEEE 32-bit floating point, written sequentially. There are no header or trailer records, and the data are distributed uncompressed. A complete description of the data format is given in section 3 above. 4.1 Getting Data To obtain data see Section 6. 4.3 Reading the 4D Assimilation Data Files Two FORTRAN programs are available from the Goddard DAAC to read the upper air data products. These programs are called the UPPER AIR SAMPLE READ PROGRAM and the SINGLE LEVEL SAMPLE READ PROGRAM. They were created and validated on the local Silicon Graphics computers at the Goddard DAAC running Unix operating systems. They may require modification for execution in your local computing environment. These programs are included in Appendix A of this document. A sample loop from these read programs also is given below. PARAMETER (IM=89,JNP=27) REAL FIELD(IM,JNP) DO 1 ITIMES=1,NTIMES DO 2 IXX=1,NXX READ(8) FIELD 2 CONTINUE 1 CONTINUE where IM is the number of longitude grid points JNP is the number of latitude grid points NTIMES is the number of time values data was written NXX is either the number of levels in the file for upper air quantities OR it is the number of variables in the file for single level quantities. For upper air files, the first level is the lowest altitude (1000mb). For each DATA FILE there is an associated TABLE FILE as mentioned in section 3. Some DATA FILES contain a number of different fields at a single level; the TABLE FILE is used to determine how many fields are in the data set. >From the TABLE FILE NTIMES is provided by TDEF (number of time steps) NXX is provided by ZDEF for upper air files (vertical levels) NXX is provided by VARS for single level files (parameters) IM is provided by XDEF (longitude) JNP is provided by YDEF (latitude) The 4-D Assimilated Data Set is compatible with the Grid Analysis and Display System (GrADS) data analysis and visualization software package. This package was developed by Brian Doty with support from Jim Kinter at the Center for Ocean-Land-Atmosphere Studies (COLA) at the University of Maryland. It has gained widespread acceptance by the scientific community as a valuable data analysis tool. The system handles a variety of gridded data sets and observational data. Both the data and table files are required as input in the GrADS package. The software is distributed and supported by the authors via anonymous FTP at COLA (see below for contact information). 5. REFERENCES 1. System documentation and guides "Information on The GSFC Distributed Active Archive Center," August 1994. "The Goddard DAAC Users Guide," October 1994. "EOSDIS Information Management System Users Manual," July 1994. 2. Data set documentation and guides A Guide document written by Daniel Ziskin of the Goddard DAAC containing detailed information on this data set is available via Mosaic at the following URL: http://disc.sci.gsfc.nasa.gov/DATASET_DOCS/toga_dataset.html A reference document is available from the anonymous FTP site hera.gsfc.nasa.gov located in the directory "pub/assimilation/toga". The file name is as follows: toga_doc.ps A detailed description of the Data Assimilation System is located in the directory "pub/tech_memos". The compressed file is named volume_4.ps.Z. A detailed description of the Global Circulation Model is located in the directory "pub/gcm". The compressed file is named geos1.0_gcm.doc.ps.Z. Corrections to this document can be found in the compressed file geos1.0_gcm.errata.ps.Z. 6. POINTS OF CONTACT Archive Information The GSFC DAAC online Information Management System (IMS) provides online search and order capabilities for the 4-D and other data sets. The system is open to the public, but access to certain restricted products requires special authorization. Follow the instructions below to connect to the Goddard DAAC computer via TCP/IP protocol (Internet). 1) Type the following command from your computer connected to the Internet: telnet disc.gsfc.nasa.gov If the message HOST UNKNOWN appears, use the direct TCP/IP address 192.107.190.139. You should receive the message "Connected to disc.gsfc.nasa.gov". 2) After you have connected to the Godard DAAC computer, you will be prompted to log onto the system. Enter the following user name and password: Username: daacims Password: gsfcdaac A series of informational messages will be displayed followed by a user registration screen and the actual data system menu screens. For more information on the Goddard DAAC system capabilities and supported data sets, contact our User Services Office at: Goddard DAAC User Services Office Code 610.2 NASA/Goddard Space Flight Center Greenbelt, MD 20771 phone: (301) 286-3209 fax: (301) 286-1775 email: daacuso@disc.gsfc.nasa.gov Data Producer Information: David Ledvina Data Assimilation Office Code 910.3 NASA/Goddard Space Flight Center Greenbelt, MD 20771 phone: (301) 805-7955 email: ledvina@dao.gsfc.nasa.gov Arthur Hou Data Assimilation Office Code 910.3 NASA/Goddard Space Flight Center Greenbelt, MD 20771 phone: (301) 286-3594 email: hou@dao.gsfc.nasa.gov GrADS software information Brian Doty Center for Ocean-Land-Atmosphere Studies 4041 Powder Mill Road, Suite 302 Calverton, MD 20705 phone: (301) 595-7000 or (301) 902-1248 email: doty@cola.iges.org 7. DATA SET UPDATES There have been no major processing changes in the data set. APPENDIX A: SAMPLE READ PROGRAMS Below is a read program for Surface Prognostic files which can be extracted and modified. C**************************************** C C This program will read the DAO sfcprog data file. It C can be easily modified to read other files which contain C other Single Layer records. C C written by Daniel Ziskin C and David Ledvina C**************************************** C Each record is a 89x27 grid INTEGER ILAT, ILON PARAMETER (ILON=89) PARAMETER (ILAT=27) C There are nine fields in this file REAL*4 PHIS(ILON,ILAT), ALBD(ILON,ILAT), GWET(ILON,ILAT) REAL*4 PS_PTOP(ILON,ILAT), GTMP(ILON,ILAT), SLB(ILON,ILAT) REAL*4 LWI(ILON,ILAT), UBAR(ILON,ILAT), VBAR(ILON,ILAT) C 30 days in November and readings 4 times per day INTEGER NDAYS,NHRS PARAMETER (NDAYS=30) PARAMETER (NHRS=4) INTEGER IDAY, IHR, LUN, I, J, K DATA LUN_IN /21/ DATA LUN_OUT /22/ CHARACTER*80 FILENAME_IN,FILENAME_OUT FILENAME_IN ="edvl049.prs.sfcprog_ss.b921101.e921130" FILENAME_OUT="edvl049.slp_ss.b921101.e921130" C These OPEN statements were written for an SGI computer. C They were not tested on any other platform and may fail. OPEN(UNIT=LUN_IN, STATUS='OLD', + FILE=FILENAME_IN, ACCESS='SEQUENTIAL',FORM='UNFORMATTED') OPEN(UNIT=LUN_OUT, STATUS='OLD', + FILE=FILENAME_OUT, ACCESS='SEQUENTIAL',FORM='UNFORMATTED') C************ THESE LOOPS READ THE WHOLE FILE ********** DO 80 IDAY=1,NDAYS DO 70 IHR=1,NHRS READ(UNIT=LUN_IN)PHIS READ(UNIT=LUN_IN)ALBD READ(UNIT=LUN_IN)GWET READ(UNIT=LUN_IN)PS_PTOP READ(UNIT=LUN_IN)GTMP READ(UNIT=LUN_IN)SLP READ(UNIT=LUN_IN)LWI READ(UNIT=LUN_IN)UBAR READ(UNIT=LUN_IN)VBAR C************This writes a timeseries of a single field to a file WRITE(LUN_OUT) SLP 70 CONTINUE 80 CONTINUE CLOSE(LUN_IN) CLOSE(LUN_OUT) STOP END Below is a read program for Upper Air Files which can be extracted and modified. C**************************************** C C This program will read a DAO upper air data file. It C can be easily modified to read other files which contain C other 18 Pressure Layers. C C written by Daniel Ziskin C and David Ledvina C**************************************** C Each record is a 89x27 grid INTEGER ILAT, ILON PARAMETER (ILON=89) PARAMETER (ILAT=27) C The data at each level will be read into the variable LEVEL REAL*4 LEVEL(ILON,ILAT) C 30 days in November and readings 4 times per day INTEGER NDAYS,NHRS PARAMETER (NDAYS=30) PARAMETER (NHRS=4) C There are 18 pressure levels C The first pressure level is 1000mb INTEGER NPRES,IPRES PARAMETER (NPRES=18) INTEGER IDAY, IHR, LUN, I, J, K DATA LUN_IN /21/ DATA LUN_OUT /22/ CHARACTER*80 FILENAME_IN,FILENAME_OUT FILENAME_IN ="edvl049.prs.uwnd_ss.b921101.e921130" FILENAME_OUT="edvl049.uwnd_1000_ss.b921101.e921130" C These OPEN statements were written for an SGI computer. C They were not tested on any other platform and may fail. OPEN(UNIT=LUN_IN, STATUS='OLD', + FILE=FILENAME_IN, ACCESS='SEQUENTIAL',FORM='UNFORMATTED') OPEN(UNIT=LUN_OUT, STATUS='OLD', + FILE=FILENAME_OUT, ACCESS='SEQUENTIAL',FORM='UNFORMATTED') C************ THESE LOOPS READ THE WHOLE FILE ********** DO 80 IDAY=1,NDAYS DO 70 IHR=1,NHRS DO 60 IPRES=1,NPRES READ(UNIT=LUN)LEVEL 60 CONTINUE C******* THIS WRITES OUT ONLY THE 1000mb LEVEL FOR UWND ********* IF(IPRES .EQ. 1) WRITE(LUN_OUT) LEVEL 70 CONTINUE 80 CONTINUE C NB the first record is closest to the surface and C the altitude increases as IPRES increases CLOSE(LUN_IN) CLOSE(LUN_OUT) STOP END