Home > Computers > Internet > Searching > Directories > DMOZ > Research Papers > Web Spam
http://dbpubs.stanford.edu:8090/pub/showDoc.Fulltext?lang=en&doc=2004-52&format=pdf&compression=&name=2004-52.pdf
Zoltan Gyongyi, Hector Garcia-Molina, Stanford University, and Jan Pedersen, Yahoo. Proceedings of the 30th VLDB Conference, 2004. The authors propose techniques which allow to semi-automatically identify reputable pages and then discover more good pages based on the structure of the web. ODP is mentioned because setting up ODP clones is a technique to influence PageRank: to balance this spamming technique, the authors removed all sites which are not listed in the major directories from the data set used for the experiment.
http://www2006.org/programme/files/xhtml/3115/fp3115-wu/fp3115-wu-xhtml.html
Baoning Wu, Vinay Goel and Brian D. Davison propose to partition the seed set used in TrustRank by topic and calculate trust scores for each topic separately, making use of the Open Directory Project. Paper presented to the 15th International World Wide Web Conference, May 2006.
http://airweb.cse.lehigh.edu/2005/gyongyi.pdf
By Zoltán Gyöngyi and Hector Garcia-Molina, Stanford University. First International Workshop on Adversarial Information Retrieval on the Web, May 2005. Offers a definition of spam and an overview on current spamming techniques. The ODP guidelines are quoted as example for existing definitions of spam.
Home > Computers > Internet > Searching > Directories > DMOZ > Research Papers > Web Spam
Thanks to DMOZ, which built a great web directory for nearly two decades and freely shared it with the web. About us