Open Data Collections 

The Art Genome Project (Artsy)

Controlled vocabulary of genre terms used by Artsy to describe and organize their materials. Includes a link to the public API which provides access to Artsy images in the public domain.

The Art Institute of Chicago

The museum provides a REST API with JSON-formatted data for all of their digital collection materials. 50,000 works from their collection are in the public domain or licensed by the museum as CC0, and images are provided of those works.

Biodiversity Heritage Library

The Biodiversity Heritage Library (BHL) provides data exports and APIs to allow individual users and data providers to download, remix and reuse BHL content. This includes an extensive collection of scientific illustrations, sampled here.

Carnegie Museum of Art’s Collection Dataset

Two metadata collections for the museums accessessioned collections and the Teenie Harris Archive. Available as CSV and JSON files, the dataset has a CC0 license, which does not extend to linked images for artworks. 

The Cleveland Museum of Art

Data is made available for over 63,000 works of art and images are made available for 34,000 works. Most data and images are licensed CC0, and the license is indicated in the API.

Getty Vocabularies as Linked Open Data

Harvard Art Museums

Over 235,000 works are available through the website and a JSON API

Library of Congress Prints and Photographs

JSON HTTP API access to Prints and Photographs Online Catalog, which includes approximately 1.2 million images. Neither rights nor licenses are asserted in most cases, with users being responsible for determining if they have the right to reuse content.

Linked Open Data at the Smithsonian American Art Museum

Metadata collections on artworks and artists available under a CC0 license. Metadata collections are accessible via online collections search, URI schema, or SPARQL endpoint.

Metropolitan Museum of Art API

Access to datasets of more than 470,000 works from the Met, provided for unrestricted use and licenses as CC0. The RESTful API provides data in JSON format and links to high resolution JPEG images.

Metropolitan Museum of Art Open Access CSV

Contains datasets of selective collections of artworks in The Met’s collections available as a CSV. Released under a CC0 license. 

Museum of Modern Art (MOMA)

Research dataset for all catalogued artworks within the museum’s collection containing metadata for over 130,000 objects. The artists dataset contains over 15,000 creators. Both datasets are available as CSV and JSON files and have been released under a CC0 license. 

National Gallery of Art (NGA)

You can download the dataset of factual art object information for the 130,000+ artworks and artists in our collection from Github. We release this data under a Creative Commons 0 (CC0) license. This means that you can download the full dataset free of charge, and without seeking our permission.

Nationalmuseum Sweden Wikidata Collection

Wikidata collection of ~4500 paintings with links to high resolution public domain images and IIIF resources. CC0 license. 

Open Source at Cooper Hewitt

A collection of open tools and resources from the museum. Collection data, excluding images, is released under CC0 and available as a downloadable spreadsheet, as individual JSON files, and through the public API. 

The Tate Collection

Inactive dataset last updated in 2014, contains records of around 70,000 objects jointly owned by Tate and the National Gallery of Scotland. JSON and CSV files under a CC0 license. 

Wikidata: The Sum of All Paintings

A project to put every painting in the world into Wikidata. Data is free for access and reuse, notably through a SPARQL query service and an API. Some examples of how the data are already being used can be viewed here.

Yale Center for British Art

Collection of high resolution image files and IIIF resources for the center’s art collection. Public Domain.