Smithsonian Open Access Archive

This repository includes approximately 784 terabytes (8.7 million files) of public domain data from the Smithsonian Institution's Open Access collection. Sourced from more than 20 libraries, museums, and research centers across the Smithsonian, this archive is updated weekly.

Product Details

Visibility: Public
Owner: Harvard Library Innovation Lab
Created: 20 Nov 2025
Last Updated: 27 Mar 2026

Product Contents

root

Product Details

Product Contents

Smithsonian Open Access Archive

Product Details

Visibility: Public
Owner: Harvard Library Innovation Lab
Created: 20 Nov 2025
Last Updated: 27 Mar 2026

Product Contents

root

README

Smithsonian Open Access Archive

This is a mirror of public domain data archived from the Smithsonian Institution's Open Access S3 bucket. The source data can also be searched via the Smithsonian's Collections Search Center by limiting results to CC0 media. We look forward to enhancing the usability and discoverability of this data in the coming months.

This repository is maintained by the Library Innovation Lab at Harvard Law School Library as part of our Public Data Project.

Navigation
Downloading data
Update frequency
Smithsonian unit codes

At present, this repository mirrors the directory structure used by the Smithsonian:

Each root-level directory contains a different type of data: 3d contains 3D models, media contains images, and metadata contains metadata for all objects. For more information on working with a given type of data, please read the corresponding section below.

3D models

3D models are organized solely by identifier. Unlike images and metadata, they are not grouped by Smithsonian unit code. Each subdirectory under 3d matches an object identifier, and may contain a number of objects including 3D geometry files (GLB, GLTF, OBJ) and other material:

Also included in each 3d subdirectory is scene.svx.json, an SVX file comprising Smithsonian Voyager scene information as well as general object metadata. This metadata typically includes the object's name, description, and accession number, as well as associated links and identifiers.

Images

Images are organized by Smithsonian unit code. Each subdirectory under media is named for a Smithsonian unit and contains JPEG and TIFF images for that unit:

There is typically, though not always, a high-resolution TIFF for every JPEG and vice versa. Each image file is referenced in an associated metadata record.

Metadata

Metadata is organized by Smithsonian unit code and grouped in large text files containing line-delimited JSON records:

Also included in each subdirectory is index.txt, an index file listing all the metadata files for that directory.

More than 17 million metadata records, constituting over 47 GB, are included. As a consequence, the metadata files are quite large, and querying them is memory- and time-intensive. If analysis is your goal, we recommend downloading a relevant subset of files and then querying them using a database or an efficient data storage format such as Parquet.

Downloading data

To download an individual data object by name, copy its source URL from the user interface:

To download large numbers of files, we recommend using tools such as the AWS CLI or Rclone to access the S3 endpoint directly:

Update frequency

The files in this repository were first collected beginning in August 2025. The repository is updated weekly to mirror additions to the Smithsonian Institution's Open Access S3 bucket.

Smithsonian unit codes

Here is a list of Smithsonian Institution unit codes used to organize parts of this collection:

Code	Unit
AAA	Archives of American Art
ACM	Anacostia Community Museum
CFCHFOLKLIFE	Ralph Rinzler Folklife Archives and Collections
CHNDM	Cooper Hewitt, Smithsonian Design Museum
EEPA	Eliot Elisofon Photographic Archives
FBR	Smithsonian Field Book Project
FSG	Freer Gallery of Art and Arthur M. Sackler Gallery (National Museum of Asian Art)
HAC	Smithsonian Gardens
HMSG	Hirshhorn Museum and Sculpture Garden
HSFA	Human Studies Film Archives
NAA	National Anthropological Archives
NASM	National Air and Space Museum
NMAAHC	National Museum of African American History and Culture
NMAH	National Museum of American History
NMAI

README

Smithsonian Open Access Archive

This repository is maintained by the Library Innovation Lab at Harvard Law School Library as part of our Public Data Project.

Navigation
Downloading data
Update frequency
Smithsonian unit codes

At present, this repository mirrors the directory structure used by the Smithsonian:

3D models

Images

Images are organized by Smithsonian unit code. Each subdirectory under media is named for a Smithsonian unit and contains JPEG and TIFF images for that unit:

There is typically, though not always, a high-resolution TIFF for every JPEG and vice versa. Each image file is referenced in an associated metadata record.

Metadata

Metadata is organized by Smithsonian unit code and grouped in large text files containing line-delimited JSON records:

Also included in each subdirectory is index.txt, an index file listing all the metadata files for that directory.

Downloading data

To download an individual data object by name, copy its source URL from the user interface:

To download large numbers of files, we recommend using tools such as the AWS CLI or Rclone to access the S3 endpoint directly:

Update frequency

The files in this repository were first collected beginning in August 2025. The repository is updated weekly to mirror additions to the Smithsonian Institution's Open Access S3 bucket.

Smithsonian unit codes

Here is a list of Smithsonian Institution unit codes used to organize parts of this collection:

Code	Unit
AAA	Archives of American Art
ACM	Anacostia Community Museum
CFCHFOLKLIFE	Ralph Rinzler Folklife Archives and Collections
CHNDM	Cooper Hewitt, Smithsonian Design Museum
EEPA	Eliot Elisofon Photographic Archives
FBR	Smithsonian Field Book Project
FSG	Freer Gallery of Art and Arthur M. Sackler Gallery (National Museum of Asian Art)
HAC	Smithsonian Gardens
HMSG	Hirshhorn Museum and Sculpture Garden
HSFA	Human Studies Film Archives
NAA	National Anthropological Archives
NASM	National Air and Space Museum
NMAAHC	National Museum of African American History and Culture
NMAH	National Museum of American History
NMAI

1root
2├── 3d/
3│   ├── 002f2567-7384-4027-9b99-bcb7a7e6361e/
4│   ├── 00416a1c-358a-4f5b-9a90-053df0d6fd31/
5│   └── …
6├── media/
7│   ├── aaa/
8│   ├── acm/
9│   └── …
10├── metadata/edan/
11│   ├── aaa/
12│   ├── aag/
13│   └── …
14└── README.md

13d/
2├── d8c62f94-4ebc-11ea-b77f-2e728ce88125/
3│   ├── articles/
4│   ├── resources/
5│   ├── f1930_54-combined_std.usdz
6│   ├── f1930_54-combined-100K-2048_std.glb
7│   ├── …
8│   └── scene.svx.json
9└── …

1media/
2├── npg/
3│   ├── NPG-6500034A_1.jpg
4│   ├── NPG-6500034A_1.tif
5│   ├── NPG-6500100A_1.jpg
6│   ├── NPG-6500100A_1.tif
7│   └── …
8└── …

1metadata/edan/
2├── nasm/
3│   ├── 00.txt
4│   ├── 01.txt
5│   ├── 02.txt
6│   ├── …
7│   └── index.txt
8└── …

1https://data.source.coop/harvard-lil/smithsonian-open-access/3d/d8c62f94-4ebc-11ea-b77f-2e728ce88125/f1930_54-combined-100K-2048_std.glb
2
3https://data.source.coop/harvard-lil/smithsonian-open-access/media/npg/NPG-NPG_79_209Truth_d1.jpg
4
5https://data.source.coop/harvard-lil/smithsonian-open-access/metadata/edan/nasm/02.txt

1root
2├── 3d/
3│   ├── 002f2567-7384-4027-9b99-bcb7a7e6361e/
4│   ├── 00416a1c-358a-4f5b-9a90-053df0d6fd31/
5│   └── …
6├── media/
7│   ├── aaa/
8│   ├── acm/
9│   └── …
10├── metadata/edan/
11│   ├── aaa/
12│   ├── aag/
13│   └── …
14└── README.md

13d/
2├── d8c62f94-4ebc-11ea-b77f-2e728ce88125/
3│   ├── articles/
4│   ├── resources/
5│   ├── f1930_54-combined_std.usdz
6│   ├── f1930_54-combined-100K-2048_std.glb
7│   ├── …
8│   └── scene.svx.json
9└── …

1media/
2├── npg/
3│   ├── NPG-6500034A_1.jpg
4│   ├── NPG-6500034A_1.tif
5│   ├── NPG-6500100A_1.jpg
6│   ├── NPG-6500100A_1.tif
7│   └── …
8└── …

1metadata/edan/
2├── nasm/
3│   ├── 00.txt
4│   ├── 01.txt
5│   ├── 02.txt
6│   ├── …
7│   └── index.txt
8└── …

1https://data.source.coop/harvard-lil/smithsonian-open-access/3d/d8c62f94-4ebc-11ea-b77f-2e728ce88125/f1930_54-combined-100K-2048_std.glb
2
3https://data.source.coop/harvard-lil/smithsonian-open-access/media/npg/NPG-NPG_79_209Truth_d1.jpg
4
5https://data.source.coop/harvard-lil/smithsonian-open-access/metadata/edan/nasm/02.txt

1aws s3 cp s3://us-west-2.opendata.source.coop/harvard-lil/smithsonian-open-access/metadata/edan/nasm/02.txt --no-sign-request

1aws s3 cp s3://us-west-2.opendata.source.coop/harvard-lil/smithsonian-open-access/metadata/edan/nasm/02.txt --no-sign-request

Smithsonian Open Access Archive

Smithsonian Open Access Archive

Smithsonian Open Access Archive

Contents

Navigation

3D models

Images

Metadata

Downloading data

Update frequency

Smithsonian unit codes

Smithsonian Open Access Archive

Contents

Navigation

3D models

Images

Metadata

Downloading data

Update frequency

Smithsonian unit codes