Extract ECMWF data

Meso-NH can be initialized and forced at its lateral boundary conditions by ECMWF data (operational data, ERAI and/or ERA5). Hereafter you will find the procedure to extract these data.

Operational data (analysis, forecast or ensemble)

Access to ECMWF operational data is restricted to registred users. See ECMWF’s website for more detailed informations.

Configure ECMWF API

First you need to create a file called .ecmwfapirc in your HOME directory. This file contains your key and email given by ECMWF on https://api.ecmwf.int/v1/key/. Your .ecmwfapirc needs to contain lines that look likes:

{
 "url"   : "https://api.ecmwf.int/v1",
 "key"   : "your_key",
 "email" : "your_email"
}

Warning

You need to suppress rules for group and other users with chmod 600 .ecmwfapirc.

Get the tool

To get the tool you can download it with :

git clone https://github.com/JorisPianezze/EXTRACT_ECMWF.git

Note

This Git’s repository is a forked from https://github.com/wurtzj/EXTRACT_ECMWF. It contains additional variables compared to the original.

This tool works as follows :

define_parameters.py is used to read the configuration file called user_parameters.json ;
main_extract_ecmwf.py execute extract_ecmwf.py file in background with nohup.

Install python environment

To install python environment required to extract ECMWF data, if you are using conda, do

conda env create -f environment.yml

Then load new created python environment :

conda activate env_extract_ecmwf

Tip

environment.yml file is located in the Git repository.

Note

Last conda environment tested is

name: env_extract_ecmwf
channels:
  - conda-forge
dependencies:
  - python=3.11.4
  - numpy=1.25.0
  - pandas=2.1.4
  - ecmwf-api-client=1.6.3
  - eccodes=2.33.0

pandas is used to manage dates and eccodes is used to perform some operations on extracted grib files (concatenate and separate by hours)

Define extraction’s parameters

To define your extraction’s area and dates, fill user_parameters.json file. This configuration file contains following variables :

{

"comments"           : "-----------------------------------------------------------",
"comments"           : "  target_directory is where your data will be written      ",
"comments"           : "-----------------------------------------------------------",

"target_directory"   : "/path/to/your/targer/directory/",

"comments"           : "-----------------------------------------------------------",
"comments"           : "  date to be extracted is take between start_date and      ",
"comments"           : "  end_date if list_of_dates is empty ([]).                 ",
"comments"           : "-----------------------------------------------------------",

"start_date"         : "20170708",
"end_date"           : "20170708",
"list_of_dates"      : [],

"comments"           : "-----------------------------------------------------------",
"comments"           : "  start_time, first time to extract within the day         ",
"comments"           : "  end_time : last time extraction within the day           ",
"comments"           : "  step: time step extraction within the day,               ",
"comments"           : "  if end_time=start_time, only start_time will be extracted",
"comments"           : "  forecast_start_time : in case of forecast : time of      ",
"comments"           : "  launch (00 or 12 for instance)                           ",
"comments"           : "-----------------------------------------------------------",

"start_time"         :"00",
"end_time"           :"00",
"step"               :"06",
"forecast_start_time":"00",

"comments"           : "-----------------------------------------------------------",
"comments"           : "  domain extension lat_min, lat_max, lon_min, lon_max or   ",
"comments"           : "  area. If empty get Europe domain.                        ",
"comments"           : "  grid resolution .1 by default. Could be coarser          ",
"comments"           : "-----------------------------------------------------------",

"lat_min"            : "",
"lat_max"            : "",
"lon_min"            : "",
"lon_max"            : "",
"area"               : "",
"grid"               : ".1",

"comments"           : "-----------------------------------------------------------",
"comments"           : "  type_data : type of data to be extracted :               ",
"comments"           : "  analysis, forecast or ensemble.                          ",
"comments"           : "-----------------------------------------------------------",

"type_data"          : "analysis",

"comments"           : "-----------------------------------------------------------",
"comments"           : "  get_surface : get surface parameters or not.             ",
"comments"           : "  Mandatory for Meso-NH users.                             ",
"comments"           : "  get_sea_state : get sea state parameters or not.         ",
"comments"           : "  Used only when sea salt aerosols are activated.          ",
"comments"           : "-----------------------------------------------------------",

"get_surface"        : true,
"get_sea_state"      : false,

"comments"           : "-----------------------------------------------------------",
"comments"           : "  remove_tmp_files : remove temporary files or not.        ",
"comments"           : "  True by default.                                         ",
"comments"           : "-----------------------------------------------------------",

"remove_tmp_files"   : true

}

Launch extraction

Change the target_directory path in the user_parameters.json file and launch extraction of the ECMWF data after loading conda environment :

python main_extract_ecmwf.py

Note

Stored grib files are located in target_directory defined in user_parameters.json file.
At the end of the script extractions are launched in background. If you are using your laptop or personal computer do not switch it off until extraction is complete. If you are using supercomputer you can logout without problems.
You can follow extraction processes on https://apps.ecmwf.int/webmars/joblist/.

Tip

If number of dates is greater than 10, you can do

nohup python main_extract_ecmwf.py

Add new variables

If you want to add variables to extracted grib files, go to define_parameters.py file and add your own grib identifier in the following list :

if type=="FC":
    pressure="/134"
    param_atm="130/131/132/133"
    param_surf="129/172/139/141/170/183/236/39/40/41/42"+ pressure
    param_sea_stae="229/234/237"
else:
    pressure="/152"
    param_atm="130/131/132/133" + pressure
    param_surf="129/172/139/141/170/183/236/39/40/41/42"
    param_sea_state="229/234/237"

Grib identifier mandatory for Meso-NH
Grib Id	Signification	Unit	Type
129	Geopotential	m2/s2	Model level (ml)
172	Land-sea lask	0 - 1	Surface (sfc)
43	Soil type	m2/s2	Surface (sfc)
139	Soil temperature at level 1	K	Surface (sfc)
141	Snow depth	m2/s2	Surface (sfc)
170	Soil temperature at level 2	K	Surface (sfc)
183	Soil temperature at level 3	K	Surface (sfc)
236	Soil temperature at level 4	K	Surface (sfc)
39	Volumetric soil water at layer 1	m3/m3	Surface (sfc)
40	Volumetric soil water at layer 2	m3/m3	Surface (sfc)
41	Volumetric soil water at layer 3	m3/m3	Surface (sfc)
42	Volumetric soil water at layer 4	m3/m3	Surface (sfc)
133	Specific humidity	kg/kg	Model level (ml)
130	Temperature	K	Model level (ml)
131	u-wind	m/s	Model level (ml)
132	v-wind	m/s	Model level (ml)
152 or 134	lnsp (analysis) or sp (forecast)	or Pa	Model level (ml)
229	significant wave height of wind waves and swell	m	Wave (wave)
234	significant wave height of wind waves	m	Wave (wave)
237	significant wave height of swell	m	Wave (wave)

Note

You can find grib identifier here.

Known issues

Error during PREP_REAL_CASE

If you encounter this error during PREP_REAL_CASE :

ECCODES ERROR: grib_accessor_data_ccsds_packing: CCSDS support not enabled. Please rebuild with -DENABLE_AEC=ON (Adaptive Entropy Coding library)
ECCODES ERROR : get: values Functionality not enabled

this means that you are using an ECCODES version < 2.30.0, which is not able to handle CCSDS compressed fields. In this case you need :

either recompile Meso-NH with an ECCODES version > 2.30.0
or add the packing=simple option to the ECMWF API request

Expiration of access token

If you encounter this error during this error during execution of the script :

ecmwfapi.api.APIException: 'ecmwf.API error 1: Access token expired'

You need to regenerate the .ecmwfapirc file with new one https://api.ecmwf.int/v1/key/.

ERA Interim

Go to http://apps.ecmwf.int/datasets/data/interim-full-daily/
Login

Get invariant data

select “Invariant” in “ERA Interim Fields”
select the parameters “Geopotential” and “Land-sea mask”
click on “retrieve GRIB”
- download the grib file and rename it “invariant.grib”. It should contain the variables “z”, “lsm”

Get surface data

select “Daily” in “ERA Interim Fields”
select “Surface” in “Type of level”
select the date YYYY-MM-DD
select the time “HH:00:00”
select the step “0”
select the parameters “Snow depth”, “Soil temperature level 1”, “Soil temperature level 2”, “Soil temperature level 3”, “Soil temperature level 4”, “Volumetric soil water layer 1”, “Volumetric soil water layer 2”, “Volumetric soil water layer 3”, “Volumetric soil water layer 4”
click on “retrieve GRIB”
download the grib file and rename it “surface.grib”. It should contain the variables “sd”, “swvl1”, “swvl2”, “swvl3”, “swvl4”, “stl1”, “stl2”, “stl3”, “stl4”

Get model level data

select “Daily” in “ERA Interim Fields”
select “Model levels” in “Type of level”
select the date YYYY-MM-DD
select the time “HH:00:00”
select the step “0”
select the parameters “Logarithm of surface pressure”, “Specific humidity”, “Temperature”, “U component of wind”, “V component of wind”
click on “retrieve GRIB”
download the grib file and rename it “model.grib”. It should contain the variables “lnsp”, “q”, “t”, “u”, “v”

Concatenate the grib files

$SRC_MESONH/src/dir_obj${XYZ}/MASTER/ECCODES-2.18.0/bin/grib_copy invariant.grib surface.grib model.grib ecmwf.EI.YYYYMMDD.HH

ERA5

Registration

Access to CDS data is restricted to registred users. See CDS’s website for more detailed informations.

Configure

Once your account create, you need to configure CDS API following https://cds.climate.copernicus.eu/api-how-to instructions. First you need to create a file called .cdsapirc in your HOME directory. Your .cdsapirc file needs to contain lines that look likes:

url: https://cds.climate.copernicus.eu/api/v2
key: your_key

Note

To fill this file go to https://cds.climate.copernicus.eu/api-how-to.

Warning

You need to suppress rules for group and other users with chmod 600 .cdsapirc.

Installation

You can install CDS API required to extract CAMS data using following conda command in your conda environment, do

conda install cdsapi

If you want to used a dedicated conda environment you can create an environment.yml file containing :

name: env_extract_cdsapi
channels:
    - conda-forge
dependencies:
    - cdsapi==0.7.0
    - eccodes==2.35.0
    - pip==24.0
    - pip:
        - yaml-config==0.1.5

Note

This is the last version of cdsapi and yaml-config tested.
yaml-config is used to read .adsapirc file if your are intent to extract CAMS files. For ERA5, only cdsapi is necessary.
eccodes is used to concatenate grib files.

Then you can create your conda environment with :

conda env create -f environment.yml

Then load new created python environment :

conda activate env_extract_cdsapi

Example

To extract ERA5 data, you can adapt the area and the date in the following script :

#!/bin/python
# --------------------------------------------------------
#
#                 Author  (    date    ) :
#             J. Pianezze ( 17.05.2024 )
#
#                    ~~~~~~~~~~~~~~~
#       Script used to extract ERA5 instantaneous fields
#        for Meso-NH (PREP_REAL_CASE) (1 time / file)
#                    ~~~~~~~~~~~~~~~
#
# --------------------------------------------------------
# https://cds.climate.copernicus.eu/api-how-to
# conda install cdsapi

import os, sys
import glob
import cdsapi
import datetime

cds = cdsapi.Client()

# #########################################################
# ###           To be defined by user                   ###
# #########################################################

# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# - -     First and last date to be extracted           - -
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
first_date_to_be_extracted = datetime.datetime(2025, 11, 13, 18, 0, 0)
last_date_to_be_extracted  = datetime.datetime(2025, 11, 13, 19, 0, 0)

# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# - -          Type of data to be extracted             - -
# - -         analyses (an) or forecast (fc)            - -
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
type_data_to_be_extracted = 'an'

# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# - -     period_in_hr between two forcing files        - -
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
period_between_last_and_first_dates_in_hr = 1

# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
# - -     Area to be extracted : 'North/West/South/East'- -
# - -     Benguela    : '-20.0/5.0/-40.0/25.0'          - -
# - -     Gulf Stream : '50.0/-90.0/20.0/-30.0'         - -
# - -     Tutorial    : '50.0/-10.0/35.0/10.0'
# - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
area_to_be_extracted = '50.0/-10.0/35.0/10.0'

# #########################################################

# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
#   Define function to iterate over first and last dates
# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
def range_for_date(start_date, end_date, period_in_hr):
  for n in range(int((end_date - start_date).total_seconds()/(3600.0*period_in_hr))+1):
    yield start_date + datetime.timedelta(seconds=n*3600.0*period_in_hr)

# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
#   Loop over dates
# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
for date in range_for_date(first_date_to_be_extracted, last_date_to_be_extracted, period_between_last_and_first_dates_in_hr):

  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  #   Compute date and time variables
  #     date_an format is yyyy-mm-dd
  #     time_an format is hh
  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  date_to_be_extracted   = str(date.year)+'-'+str(date.month).zfill(2)+'-'+str(date.day).zfill(2)
  time_to_be_extracted   = str(date.hour).zfill(2)
  name_of_extracted_file = str(date.year)+str(date.month).zfill(2)+str(date.day).zfill(2)+'.'+str(date.hour).zfill(2)

  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  #   Retrieve Model Level fields : u, v, t, q
  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  cds.retrieve('reanalysis-era5-complete', {
        'date'     : date_to_be_extracted,
        'levelist' : '1/to/137',
        'levtype'  : 'ml',
        'param'    : 'u/v/t/q',
        'stream'   : 'oper',
        'time'     : time_to_be_extracted,
        'type'     : type_data_to_be_extracted,
        'area'     : area_to_be_extracted,
        'grid'     : '0.28125/0.28125',
  }, 'model_levels_uvtq_'+name_of_extracted_file+'.grib')

  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  #   Retrieve Model Level fields : lnsp
  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  cds.retrieve('reanalysis-era5-complete', {
        'date'     : date_to_be_extracted,
        'levelist' : '1',
        'levtype'  : 'ml',
        'param'    : 'lnsp',
        'stream'   : 'oper',
        'time'     : time_to_be_extracted,
        'type'     : type_data_to_be_extracted,
        'area'     : area_to_be_extracted,
        'grid'     : '0.28125/0.28125',
  }, 'model_levels_lnsp_'+name_of_extracted_file+'.grib')

  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  #   Retrieve SurFaCe fields : z, lsm
  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  cds.retrieve('reanalysis-era5-complete', {
        'date'     : date_to_be_extracted,
        'levtype'  : 'sfc',
        'param'    : 'z/lsm',
        'stream'   : 'oper',
        'time'     : time_to_be_extracted,
        'type'     : type_data_to_be_extracted,
        'area'     : area_to_be_extracted,
        'grid'     : '0.28125/0.28125',
  },  'surface_levels_'+name_of_extracted_file+'.grib')

  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  #   Retrieve SurFaCe fields : swlv1, ...
  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  cds.retrieve('reanalysis-era5-complete', {
      'date'     : date_to_be_extracted,
      'levtype'  : 'sfc',
      'param'    : '139/141/170/183/236/39/40/41/42',
      'stream'   : 'oper',
      'time'     : time_to_be_extracted,
      'type'     : type_data_to_be_extracted,
      'area'     : area_to_be_extracted,
      'grid'     : '0.28125/0.28125',
  }, 'surface_'+name_of_extracted_file+'.grib')

  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  #   Concatenate & remove grib files
  # ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  os.system('grib_copy surface_levels_'+name_of_extracted_file+'.grib    '+\
                      'surface_'+name_of_extracted_file+'.grib           '+\
                      'model_levels_uvtq_'+name_of_extracted_file+'.grib '+\
                      'model_levels_lnsp_'+name_of_extracted_file+'.grib '+\
                      'era5.'+name_of_extracted_file)

  for file in glob.glob('*.grib'):
     os.remove(file)

Note

You need to have eccodes installed.

Then, you can launch the extraction with :

./your_script.sh

Note

At the end of the extraction you need to have files called era5.yearmonthday.hour !

Use extracted GRIB files

To use extracted grib files in Meso-NH, you just have to add the name of the grib file in PRE_REAL1.nam file, by example :

&NAM_FILE_NAMES HATMFILE='name/of/your/grib/file',
                HATMFILETYPE='GRIBEX',
                ... /