Nav: Home

Wikipedia readers get shortchanged by copyrighted material

February 13, 2017

UNIVERSITY OF CALIFORNIA, BERKELEY'S HAAS SCHOOL OF BUSINESS--When Google Books digitized 40 years worth of copyrighted and out-of-copyright issues of Baseball Digest magazine, Wikipedia editors realized they had scored. Suddenly they had access to pages and pages of player information from a new source. Yet not all information could be used equally: citations to out-of-copyright issues increased 135 percent more than issues still subject to copyright restrictions.

Those are the results of a new study, "Does Copyright Affect Reuse? Evidence from Google Books and Wikipedia," conditionally accepted in Management Science. By studying how copyright laws restrict the free exchange of information, author Abhishek Nagaraj also found pages that could benefit from copyrighted information received 20 percent less traffic than pages that could benefit from out-of-copyright information. That presents a significant disadvantage to Wikipedia readers. Copyrighted images suffered even more lack of distribution or reuse because they cannot be paraphrased and repurposed like written information.

Perhaps more importantly, the study's findings suggest how an Internet without copyrighted material may be better used to create new content, and not just allow people to consume what's already out there.

"There is a big debate about what copyright restrictions do to the diffusion of knowledge. Some people say copyright laws have not caught up with the digital age," says Nagaraj, an assistant professor of management at UC Berkeley's Haas School of Business.

With just about everything available online now, Nagaraj chose to study Baseball Digest for several reasons. First, it is one of only a small number of publications that Google Books digitized in its entirety in 2008. Second, Baseball Digest 's copyright status changed over time; the copyright of issues published before 1964 was never renewed and therefore, all pre-1964 issues entered the public domain 28 years after their respective publication dates. At the same time, issues published in 1964 and after are not subject to renewal and remain under copyright, at least until 2020. These conditions gave Nagaraj the ability to study citation variation--under copyright and not under copyright--of the same publication. Third, Nagaraj contends that baseball's popularity would make his experiment "economically meaningful."

Nagaraj created two samples based on the digest's publication years and on 541 players' Wikipedia pages. The players were all nominated for the Baseball Hall of Fame and made their professional debuts between 1944 and 1984. By creating a "quality metric" for each player based on the number of times they played in an all-star game, Nagaraj ensured that each player in the sample had a significant baseball career. The result was a dataset that counts the number of citations to Baseball Digest on each player's Wikipedia page as well as the number of images and word citations.

The data revealed three primary results: 1) There was no variation in using information from copyrighted and out-of-copyright sources before the Google Books digitization process; 2) After Baseball Digest was digitized, Wikipedia editors started using both non-copyrighted and copyrighted information but moreso of the former; and 3) The effects varied by the type of content. Text material was reused regardless of its copyright status. For example, factual information that Babe Ruth hit a homerun moved from the Digest to Wikipedia smoothly because it could be rewritten. However photos of players and teams were reused more rarely because they could not be reproduced with any variation unrestricted by copyright protection.

"Well-known players like Yogi Berra were less affected by this variation because there are enough alternative sources of information besides Baseball Digest," explains Nagaraj. "But there are many players for whom we have limited information. People seeking information about these players are most hurt by copyright law."

This deficiency in the transfer of knowledge impacts not only Internet users who are looking for information but also users seeking to create new content. Nagaraj hopes his work will provide evidence for re-evaluating the value of copyright laws.

"The loss from future copyright extensions is likely to be high. If we want to incentivize new creative work using historical information, we need to fix the system," says Nagaraj.
See paper:

University of California - Berkeley Haas School of Business

Related Baseball Articles:

Mortality rates of major league baseball players
Major league baseball (MLB) players had lower death rates overall and from many underlying causes of death compared with men in the general US population, differences that could be associated in part with the physical fitness required for their jobs.
Outcomes of non-operatively treated elbow ulnar in professional baseball players
Professional baseball players with a low-grade elbow injury that occurs on the humeral side of the elbow have a better chance of returning to throw and returning to play, and a lower risk of ulnar collateral ligament surgery than players who suffered more severe injuries on the ulnar side of the elbow.
NJIT mathematical sciences professor releases major league baseball predictions
NJIT Mathematical Sciences Professor and Associate Dean Bruce Bukiet has published his model's projections of how the standings should look at the end of Major League Baseball's regular season in 2019.
The short, tumultuous working life of a major league baseball pitcher
There are pitchers in Major League Baseball (MLB) who have had 30-year careers, but as UC Riverside demographer David Swanson points out, these are extreme outliers and often the stars of the game who receive most of the media's attention.
Why did home runs surge in baseball? Statistics provides twist on hot topic
Around the middle of the 2015 season, something odd started happening in Major League Baseball (MLB): Home runs surged.
For professional baseball players, faster hand-eye coordination linked to batting performance
Professional baseball players who score higher on a test of hand-eye coordination have better batting performance -- particularly in drawing walks and other measures of 'plate discipline,' reports a study in the July issue of Optometry and Vision Science, the official journal of the American Academy of Optometry.
For high school baseball pitchers, extra throws on game day add up but go uncounted
For high school baseball pitchers, limiting throws during a game helps to prevent fatigue and injuries.
Foul ball! Time to abolish rule protecting MLB from liability when fans are injured
In advance of Major League Baseball's opening day on Thursday, new research from Indiana University's Kelley School of Business suggests that the risk of fans being hit by a foul ball or errant bat at games has increased in recent years.
Vision, sensory and motor testing could predict best batters in baseball
Duke Health researchers found players with higher scores on computer-based vision and motor tasks had better on-base percentages, more walks and fewer strikeouts -- collectively referred to as plate discipline -- compared to their peers.
Review finds poor compliance with helmet use in baseball and softball
Despite lower rates of traumatic brain injuries in baseball and softball, there is poor compliance overall with helmet use and return-to-play guidelines following a concussion across all levels of play, according to a new systematic review.
More Baseball News and Baseball Current Events

Trending Science News

Current Coronavirus (COVID-19) News

Top Science Podcasts

We have hand picked the top science podcasts of 2020.
Now Playing: TED Radio Hour

Debbie Millman: Designing Our Lives
From prehistoric cave art to today's social media feeds, to design is to be human. This hour, designer Debbie Millman guides us through a world made and remade–and helps us design our own paths.
Now Playing: Science for the People

#574 State of the Heart
This week we focus on heart disease, heart failure, what blood pressure is and why it's bad when it's high. Host Rachelle Saunders talks with physician, clinical researcher, and writer Haider Warraich about his book "State of the Heart: Exploring the History, Science, and Future of Cardiac Disease" and the ails of our hearts.
Now Playing: Radiolab

Insomnia Line
Coronasomnia is a not-so-surprising side-effect of the global pandemic. More and more of us are having trouble falling asleep. We wanted to find a way to get inside that nighttime world, to see why people are awake and what they are thinking about. So what'd Radiolab decide to do?  Open up the phone lines and talk to you. We created an insomnia hotline and on this week's experimental episode, we stayed up all night, taking hundreds of calls, spilling secrets, and at long last, watching the sunrise peek through.   This episode was produced by Lulu Miller with Rachael Cusick, Tracie Hunte, Tobin Low, Sarah Qari, Molly Webster, Pat Walters, Shima Oliaee, and Jonny Moens. Want more Radiolab in your life? Sign up for our newsletter! We share our latest favorites: articles, tv shows, funny Youtube videos, chocolate chip cookie recipes, and more. Support Radiolab by becoming a member today at