Nav: Home

Using crowdsourced data, scientists build massive family tree that tells tales of humanity

March 01, 2018

Taking advantage of an online database of public data shared by genealogy enthusiasts, researchers have created a massive, crowd-sourced "family tree." By tracing the lives of millions of people, their tree brings to light additional impacts of human culture on the spread of genetic information, suggesting, for example, that a recent reduction in genetic relatedness in Western societies had more to do with shifting cultural factors than it did with the advent of transportation. To date, constructing population-scale family trees has been a labor-intensive process. Here, by leveraging available social media data, Joanna Kaplanis, Yaniv Erlich, and colleagues created such a tree; they analyzed records from 86 million publicly available profiles from, a crowd-sourced genealogy website. The data reflects historical events and trends such as elevated death rates at military age during the American Civil War, WWI, and WWII, and a reduction in child mortality during the 20th century. By comparing the data from to traditional genetic studies exploring heritability, the authors found a similar, albeit smaller, estimate of the heritability of longevity. Looking at migration patterns, they uncovered that females migrate more than males in Western societies, but over shorter distances. As well, couples born between 1800 and 1850 showed a two-fold increase in their so-called marital distances, the authors say, from 8 kilometers in 1800 to 19 kilometers in 1850; intriguingly, the increase in marital distance occurred at the same time as an increase in genetic relatedness (individuals continued to marry relatives), contrary to the theory that people become more genetically diverse as they disperse. Kaplanis et al. write, "From these results, we hypothesize that changes in 19th century transportation were not the primary cause for decreased consanguinity. Rather, our results suggest that shifting cultural factors played a more important role in the recent reduction of genetic relatedness of couples in Western societies."

Notably, to provide additional samples, alleviating concerns that users do not reflect the average person sampled, the researchers obtained every death certificate issued in the state of Vermont, which has an open policy about death certificates, from 1985 to 2000, for a total of nearly 80,000 records. After locating the death certificates of nearly 1,000 profiles for individuals who died in Vermont during the same period, the researchers compared key socio-economic attributes between users and the rest of the Vermont database. Importantly, each attribute showed nearly perfect concordance, say the authors, between the profiles and the rest of the database.

American Association for the Advancement of Science

Related Data Articles:

Data centers use less energy than you think
Using the most detailed model to date of global data center energy use, researchers found that massive efficiency gains by data centers have kept energy use roughly flat over the past decade.
Storing data in music
Researchers at ETH Zurich have developed a technique for embedding data in music and transmitting it to a smartphone.
Life data economics: calling for new models to assess the value of human data
After the collapse of the blockchain bubble a number of research organisations are developing platforms to enable individual ownership of life data and establish the data valuation and pricing models.
Geoscience data group urges all scientific disciplines to make data open and accessible
Institutions, science funders, data repositories, publishers, researchers and scientific societies from all scientific disciplines must work together to ensure all scientific data are easy to find, access and use, according to a new commentary in Nature by members of the Enabling FAIR Data Steering Committee.
Democratizing data science
MIT researchers are hoping to advance the democratization of data science with a new tool for nonstatisticians that automatically generates models for analyzing raw data.
Getting the most out of atmospheric data analysis
An international team including researchers from Kanazawa University used a new approach to analyze an atmospheric data set spanning 18 years for the investigation of new-particle formation.
Ecologists ask: Should we be more transparent with data?
In a new Ecological Applications article, authors Stephen M. Powers and Stephanie E.
Should you share data of threatened species?
Scientists and conservationists have continually called for location data to be turned off in wildlife photos and publications to help preserve species but new research suggests there could be more to be gained by sharing a rare find, rather than obscuring it, in certain circumstances.
Futuristic data storage
The development of high-density data storage devices requires the highest possible density of elements in an array made up of individual nanomagnets.
Making data matter
The advent of 3-D printing has made it possible to take imaging data and print it into physical representations, but the process of doing so has been prohibitively time-intensive and costly.
More Data News and Data Current Events

Trending Science News

Current Coronavirus (COVID-19) News

Top Science Podcasts

We have hand picked the top science podcasts of 2020.
Now Playing: TED Radio Hour

Teaching For Better Humans 2.0
More than test scores or good grades–what do kids need for the future? This hour, TED speakers explore how to help children grow into better humans, both during and after this time of crisis. Guests include educators Richard Culatta and Liz Kleinrock, psychologist Thomas Curran, and writer Jacqueline Woodson.
Now Playing: Science for the People

#556 The Power of Friendship
It's 2020 and times are tough. Maybe some of us are learning about social distancing the hard way. Maybe we just are all a little anxious. No matter what, we could probably use a friend. But what is a friend, exactly? And why do we need them so much? This week host Bethany Brookshire speaks with Lydia Denworth, author of the new book "Friendship: The Evolution, Biology, and Extraordinary Power of Life's Fundamental Bond". This episode is hosted by Bethany Brookshire, science writer from Science News.
Now Playing: Radiolab

One of the most consistent questions we get at the show is from parents who want to know which episodes are kid-friendly and which aren't. So today, we're releasing a separate feed, Radiolab for Kids. To kick it off, we're rerunning an all-time favorite episode: Space. In the 60's, space exploration was an American obsession. This hour, we chart the path from romance to increasing cynicism. We begin with Ann Druyan, widow of Carl Sagan, with a story about the Voyager expedition, true love, and a golden record that travels through space. And astrophysicist Neil de Grasse Tyson explains the Coepernican Principle, and just how insignificant we are. Support Radiolab today at