Time-stamped Copies of PDB and EMDB Archives

IP
Irina Persikova
Tue, Jan 21, 2025 2:46 PM

A snapshot of the PDB Core archive (https://files.wwpdb.org/,
https://files.wwpdb.org/https://s3.rcsb.org) as of January 1, 2025 has
been added to https://s3snapshots.rcsb.org/, snapshots.rcsb.org (rsync
-rlpy -a -v --delete snapshots.rcsb.org:: .), and
ftp://snapshots.pdbj.org. Snapshots have been archived annually since
2005 to provide readily identifiable data sets for research on the PDB
archive.

The directory 20250101 includes the 229,564 experimentally-determined
structure and experimental data available at that time. Atomic
coordinate and related metadata are available in PDBx/mmCIF, PDB, and
XML file formats. The date and time stamp of each file indicates the
last time the file was modified. The snapshot of PDB Core Archive is
1,437 GB.

A snapshot of the EMDB Core archive
(ftp://ftp.ebi.ac.uk/pub/databases/emdb/) as of January 01, 2025 can be
found in ftp://ftp.ebi.ac.uk/pub/databases/emdb_vault/20250101/ and
ftp://snapshots.pdbj.org/20250101/. The snapshot of EMDB Core Archive
contains map files and their metadata within XML files for both released
and obsoleted entries (41,367 and 301, respectively) and is 21 TB in size.

--
IRINA PERSIKOVA, Ph.D.
Deputy Biocuration Lead, RCSB Protein Data Bank
Research Associate, Institute for Quantitative Biomedicine
Rutgers, The State University of New Jersey
174 Frelinghuysen Road, Piscataway NJ 08854
P: 848.445.4938 | E: irina.persikova@rcsb.org
mailto:irina.persikova@rcsb.org
rcsb.org <www.rcsb.org> | iqb.rutgers.edu http://iqb.rutgers.edu |
facebook https://www.facebook.com/RCSBPDB | twitter
https://twitter.com/buildmodels

A snapshot of the PDB Core archive (https://files.wwpdb.org/, <https://files.wwpdb.org/>https://s3.rcsb.org) as of January 1, 2025 has been added to https://s3snapshots.rcsb.org/, snapshots.rcsb.org (rsync -rlpy -a -v --delete snapshots.rcsb.org:: .), and ftp://snapshots.pdbj.org. Snapshots have been archived annually since 2005 to provide readily identifiable data sets for research on the PDB archive. The directory 20250101 includes the 229,564 experimentally-determined structure and experimental data available at that time. Atomic coordinate and related metadata are available in PDBx/mmCIF, PDB, and XML file formats. The date and time stamp of each file indicates the last time the file was modified. The snapshot of PDB Core Archive is 1,437 GB. A snapshot of the EMDB Core archive (ftp://ftp.ebi.ac.uk/pub/databases/emdb/) as of January 01, 2025 can be found in ftp://ftp.ebi.ac.uk/pub/databases/emdb_vault/20250101/ and ftp://snapshots.pdbj.org/20250101/. The snapshot of EMDB Core Archive contains map files and their metadata within XML files for both released and obsoleted entries (41,367 and 301, respectively) and is 21 TB in size. -- IRINA PERSIKOVA, Ph.D. Deputy Biocuration Lead, RCSB Protein Data Bank Research Associate, Institute for Quantitative Biomedicine Rutgers, The State University of New Jersey 174 Frelinghuysen Road, Piscataway NJ 08854 P: 848.445.4938 | E: irina.persikova@rcsb.org <mailto:irina.persikova@rcsb.org> rcsb.org <www.rcsb.org> | iqb.rutgers.edu <http://iqb.rutgers.edu> | facebook <https://www.facebook.com/RCSBPDB> | twitter <https://twitter.com/buildmodels>