Brightsurf Science News and Current Science News Events

 
Email Article | Print

IU informatics researchers throttle notion of search engine dominance

August 08, 2006

BLOOMINGTON, Ind. - Search engines are not biased toward popular Web sites, and may even be egalitarian in the way they direct traffic, say Indiana University School of Informatics researchers.

Their study, "Topical interests and the mitigation of search engine bias," in the Aug. 7-11 issue of the Proceedings of the National Academy of Sciences, challenges the view of a Web-dominating "Googlearchy" in which search engines like Google push all Web traffic to established, mainstream Web sites.




"Empirical data do not support the idea of a vicious cycle amplifying the rich-get-richer dynamic of the Web," said Filippo Menczer, associate professor of informatics and computer science. "Our study demonstrates that popular sites receive on average far less traffic than predicted by the Googlearchy theory and that the playing field is more even."

Menczer was joined in the study by IU post-doctoral fellow Santo Fortunato; Alessandro Flammini, assistant professor of informatics; and Alessandro Vespignani, professor of informatics.

The IU team pooled their expertise in Web mining, networks and complex systems to collect empirical data from various search engines. In one scenario, users browsed the Web using only random links. In another, users visited only pages returned by the search engines. The researchers also studied the way in which search engines have influenced the Web's evolution.

"A simple ranking mechanism provides an elegant model to understand the genesis of a broad class of complex systems, including social and technological networks such as the Internet and the World Wide Web," Fortunato said. "These networks possess a peculiar 'long-tail'TM structure in which a few nodes attract a great majority of connections."

The long tail structure of the Web is commonly explained through rich-get-richer models that require knowledge of the prestige of each node in the network. However, those who create and link Web pages may not know the prestige values of target pages.

In another study, "Scale-Free Network Growth by Ranking," (May 27 Physical Review Letters), the Menczer, Fortunato, and Flammini showed that for a search engine to give rise to a long tail network, it must simply sort nodes according to any prestige measure, even if the exact values are unknown. If new nodes are linked to old ones according to their ranking order, a long tail emerges.

"By sorting results, search engines give us a simple mechanism to interpret how the Web grows and how traffic is distributed among Web sites," said Menczer.

The ranking model can help understand the dynamics of other complex networks besides the Web. For example, in a social system, one may be able to tell which of two people is richer without knowing their bank account balance. Such a criterion might explain the frequency and robustness of the complex structure observed in many real networks.

Indiana University



Related Search Engine News Articles Search Engine News and Current Search Engine Events RSS Search Engine News and Current Search Engine Events RSS
Scientists launch first comprehensive database of human oral microbiome
Scientists know more today than ever before about the microbes that inhabit our mouths. They know so much, in fact, that gathering all of the relevant bits of information into one place when designing experiments can be a job in itself.

Study shows Google favored over other search engines by webmasters
Web site policy makers who use robots.txt files as gatekeepers to specify what is open and what is off limits to Web crawlers have a bias that favors Google over other search engines, say Penn State researchers whose study of more than 7,500 Web sites revealed Google's advantage.

MIT develops lecture search engine to aid students
Imagine you are taking an introductory biology course. You're studying for an exam and realize it would be helpful to revisit the professor's explanation of RNA interference. Fortunately for you, a digital recording of the lecture is online, but the 10-minute explanation you want is buried in a 90-minute lecture you don't have time to watch.

Online game feeds music search engine project at UC San Diego
UC San Diego electrical engineers and computer scientists are working together on a computerized system that will make it easy for people who are not music experts (like the senior author's mom) to find the kind of music they want to listen to - without knowing the names of artists or songs.

New proteomics research promises to revolutionize biomedical discovery
Human cells function through the concerted action of thousands of proteins that control their growth and differentiation. Yet, the specific function of most human proteins remains either unknown or poorly characterized.

Key science Web sites buried in information avalanche
As more and more people are turning to the Internet to find information, important science websites are in danger of becoming buried in the sheer avalanche of facts now available online. Key science sites are failing to register in the top 30 Google search results.

Enterprise management facilities for public authorities
Public authorities have long needed the equivalent of the enterprise management system - as used by leading companies around the world - but seldom had the resources to afford it. Now a new collaborative-working platform developed under the ICTE-PAN project may hold the solution.

Adding semantics to the Web
"The Web will become more than what we see on our computer screens, it will become a place where computers interact with each other and where meaning is attached to information." That is the vision behind a cutting-edge Semantic Web project.

Adding more meaning from place searches
Trawling the web for place-related information is tedious at the best of times. A new search engine, being tested in Europe, recognises geographical terminology and has the intelligence to understand the searches and match them to places.

Firing up knowledge for fire-fighters
Major industrial fires and incidents involving hazardous materials often place fire-fighters in the most dangerous situations of their working lives. Yet as such occurrences are rare, many will face these situations for the first time with little foreknowledge of such incidents. RIMSAT aimed to improve the odds in the fire-fighters' favour.
More Search Engine News Articles
Landing Page Optimization: The Definitive Guide to Testing and Tuning for Conversions
by Tim Ash


Ultimate Guide to Google AdWords (Ultimate Guide to Google Adwords)
by Perry Marshall, Bryan Todd


AdWords For Dummies (For Dummies (Computer/Tech))
by Howie Jacobson


Search Engine Optimization For Dummies, Second Edition (For Dummies (Computer/Tech))
by Peter Kent


SEO: Search Engine Optimization Bible
by Jerri L. Ledford


Search Engine Optimization: An Hour a Day
by Jennifer Grappone, Gradiva Couzin


The AdSense Code: What Google Never Told You About Making Money with AdSense
by Joel Comm


Get to the Top on Google: Tips and Techniques to Get Your Site to the Top of the Search Engine Rankings -- and Stay There
by David Viney


Google Analytics 2.0
by Jerri L. Ledford, Mary E. Tyler


The Google Story: Inside the Hottest Business, Media, and Technology Success of Our Time
by David Vise, Mark Malseed


© 2008 BrightSurf.com