Smithsonian Data Science Lab logo
  • Our Team
    • The Lab
    • Rebecca Dikow
    • Mike Trizna
    • Alex White
  • Research
  • Education
    • Data Science Training
    • UCSB-Smithsonian Scholars
  • Software
  • News
  • About SI-DSL
    • Who we are
    • AI Values Statement
    • Publications
    • Collaboration
    • Contact us

Federal agencies declare 2023 the Year of Open Science

The Smithsonian Data Science Lab is proud to support the Smithsonian’s effort to celebrate the benefits of open science and expand the adoption of open science practices across the institution.

We are living in an age of “big data”. Insights from big data touch almost every part of our lives—from the way we navigate in our cars to the way we shop. Big data has also arrived in biodiversity research due to rapid change in the types and volume of data that researchers can use to ask and answer their scientific questions. The Data Science Lab works with Smithsonian researchers to use big data techniques, such as deep machine learning, to generate insights from their data, whether they are derived from genome sequencing, ecological sensors, or mass digitization of museum objects. These techniques require computational expertise in hardware and software to both build new algorithms and to implement the emerging tools that are developed outside the Smithsonian.

Phyllis Diller Gag File

Comedy and AI: Analyzing the Phyllis Diller Gag File through Machine Learning
The American comedian Phyllis Diller donated a collection of 52,000 jokes to the Smithsonian Institution, known as the Gag File. Here, we use Python, machine learning, and…
William J.B. Mattingly
Jun 26, 2023

a bario fish

Machine Learning of Amazonian Fish
As part of his work with the Data Science Lab and SCBI, former graduate student fellow Dr. Alex Robillard publishes a new machine learning model trained to identify…
Alex White
May 1, 2023

Manali sanctuary in India where large numbers of temperate and tropical birds converge to form complex ecological structure

Ecostructure
A vignette for ecostructure, an R package for clustering and visualization of ecological strucutre across local, regional, phylogenetic, and functional axes of assemblage…
Alex White
Apr 24, 2023
No matching items

Our research puts data science to work in the museum

Keep up with our research projects and our fruitful collaborations

Explore our research blog

Research Blog

Phyllis Diller Gag File

Comedy and AI: Analyzing the Phyllis Diller Gag File through Machine Learning
The American comedian Phyllis Diller donated a collection of 52,000 jokes to the Smithsonian Institution, known as the Gag File. Here, we use Python, machine learning, and…
William J.B. Mattingly
Jun 26, 2023

Frances Theresa Densmore, Betty Jane Meggers, and Matilda Stevenson, each woman a member of the Anthropology Department at the National Museum of Natural History.

Uncovering the scientific impact of women at the Smithsonian
Inspired by the immense research output and strong mentoring ethos of the late Vicki Funk, Drs. Dikow and Tsuchiya use machine learning to explore the historical archives of…
Rebecca Dikow and Mirian Tsuchiya
Oct 22, 2020

Thistle specimen and illustration by MV Walcott

What’s in a name?
An investigation of digitized museum records reveals how the historical contributions of women are obscured in the vast digitized Smithsonian collections. Our intern Tiana…
Rebecca Dikow and Megan Glenn
Apr 14, 2020
No matching items

We want to connect with the Smithsonian community and beyond.

Keep up with Smithsonian Data Science Lab news

Explore our news blog

News

a bario fish

Machine Learning of Amazonian Fish
As part of his work with the Data Science Lab and SCBI, former graduate student fellow Dr. Alex Robillard publishes a new machine learning model trained to identify…
Alex White
May 1, 2023

Year of Open Science graphic

The Year of Open Science
The Smithsonian Data Science Lab is proud to support the Smithsonian’s effort to celebrate the benefits of open science and expand the adoption of open science practices…
Alex White
Apr 19, 2023
No matching items

Software development

Tutorials and vignettes for software developed in the Data Science Lab

Explore software vignettes

Software Vignettes

Manali sanctuary in India where large numbers of temperate and tropical birds converge to form complex ecological structure

Ecostructure
A vignette for ecostructure, an R package for clustering and visualization of ecological strucutre across local, regional, phylogenetic, and functional axes of assemblage…
Alex White
Apr 24, 2023
No matching items

About us

Our aims
Our team
Collaborations
Contact us

External Links

UCSB-SI Scholars Carpentries Smithsonian OCIO

Follow us

GitHub
Twitter

Copyright © 2023 Smithsonian

Edit this page

Report an issue

Privacy Statement | Terms of Use