All of Us Research Program Reports Progress on Diverse Genomic Data

April 21, 2023
Dataset provides researchers information from more than 413,450 participants, including whole genome sequences, data from surveys, electronic health records, physical measurements and Fitbit devices

In a progress report, the National Institutes of Health's All of Us Research Program announced that its dataset has grown to include nearly a quarter million whole genome sequences for broad research use.

About 45 percent of the data was donated by people who self-identify with a racial or ethnic group that has been historically underrepresented in medical research. The data expansion provides registered researchers access to the world’s largest and most diverse dataset of its kind, which it says paves the way to help advance health equity and uncover healthcare approaches better tailored to people's genes, lifestyles and environments.

"When All of Us began national enrollment five years ago, we were excited about the promise of how we could advance health research,” said Josh Denny, M.D., M.S., chief executive officer of the All of Us Research Program, in a statement. “Now, through a partnership with participants, researchers, and diverse communities across the country, we are seeing incredible progress towards powering scientific discoveries that can lead to a healthier future for all of us.”

The expanded dataset provides researchers information from more than 413,450 participants. In addition to the whole genome sequences, this resource also includes data from surveys, electronic health records, physical measurements and Fitbit devices. Fitbit device data will now include information on sleep, in addition to activity, step count and heart rate. Sleep data, when used alongside participants’ electronic health record data, could be useful for studying how sleep patterns affect overall health and disease progression, including for conditions such as heart disease, high blood pressure, diabetes, depression and dementia.

The program has also released more than 1,000 detailed long-read sequenced genomes. Long- read sequences are a novel data type that have the potential to provide a more complete picture of the genome.

Through the program’s secure, cloud-based platform for data analysis, the Researcher Workbench, registered researchers can use these data to study genetic variation and potentially identify genetic variants tied to certain conditions. Access to the Researcher Workbench is available to researchers from eligible organizations after they complete training and other requirements.

“For years, the lack of diversity in genomic datasets has limited our understanding of human health,” said Andrea Ramirez, M.D., M.S., chief data officer of the All of Us Research Program, in a statement. “By engaging participants from diverse backgrounds and sharing a more complete picture of their lives – through genomic, lifestyle, clinical and social environmental data – All of Us enables researchers to begin to better pinpoint the drivers of disease.”

The program’s dataset also allows researchers to better define important genetic risks for diseases. The whole-genome sequence dataset alone includes variation at more than 1 billion locations, which is nearly one-third of the entire human genome. These small genetic variations could offer important clues to develop predictive markers for disease risk or explain differences in the effectiveness of certain drugs for different people.

The genomic data are also used to develop personalized health-related DNA results for participants. So far, more than 25,000 participants have requested to receive one or more of these reports detailing whether they have an increased risk for specific health conditions or how their body might process certain medications. These health-related DNA results are part of the program’s commitment to share information and return value to participants, while also making their data broadly available for research use.

More than 5,000 researchers have registered to use the All of Us Researcher Workbench. Registered researchers have access to an expanded set of tools to use in working with available data types. Additionally, featured workspaces are available to provide researchers examples of ways to segment and analyze the data, which can be used to kickstart or enhance their work.

Sponsored Recommendations

10 Reasons to Run Epic on Pure

Gain efficiency & add productivity to your Epic data center. Download now to learn more!

Payer Platform Services and Support

Let’s leverage Payer Platform for smooth, seamless operations.When tasks are important and need to be done right, you trust the experts. The same is true for your...

Pure Powers Progressive Payers

Increase your business agility with Pure’s digital payer platform.Legacy storage solutions cannot keep up with the ever-expanding initiatives in the payer market. To deploy...

Executive Handbook: Ten Transformative Trends 2024

The editors of Healthcare Innovation have published their annual Ten Transformative Trends ensemble of articles