La détection du Link Spam : un challenge pour les moteurs [Bibl.]

Brevets

Brevet de Yahoo (Trustrank)

appft1.uspto.gov/netacgi/nph-Parser

Un brevet de Microsoft (astuce pour rendre le pagerank robuste au Link Spam)

appft1.uspto.gov/netacgi/nph-Parser

Un récent brevet de Google

patft.uspto.gov/netacgi/nph-Parser

BIBLIOGRAPHIE

Link-Based Characterization and Detection of Web Spam
Luca Beccheti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates
DIS – Université de Rome « La Sapienza » et Yahoo ! Research

Thwarting the Nigritude Ultramarine : Learning to Identify Link Spam
Isabel Drost and Tobias Scheffer
Université Humboldt – Berlin

Topical Trustrank : Using Topicality to Combat Web Spam
Baowing Wu, Vinay Goel, Brian D. Davison
Université Lehigh, Betlehem USA

Spam, Damn Spam, and Statistics : Using statistical analysis to locate spam web pages
Dennis Fetterly, Mark Manasse, Marc Najork

Microsoft Research

Who links to whom : Mining Linkage between Web Sites
Krishna Bharat, Bay-Wei Chang, Monika Henzinger, Mathias Ruhl
Google, MIT Cambridge USA

Pagerank Increase under Different Collusion Topologies
Ricardo Baeza-Yates, Carlos Castillo, Vicente Lopez
ICREA Uiversité Pompeu Fabra, Université du Chili

Link Spam Alliances
Zoltan Gyöngyi, Hector Garcia-Molina
Université de Stanford

Web Spam Taxonomy
Zoltan Gyöngyi, Hector Garcia-Molina
Université de Stanford

Inside PagerankM. Bianchini, M. Gori, F. Scarselli
ACM Transactions on Internet

Deeper Inside Pagerank
A. Langville, C. Meyer
Internet mathematics 2004

Making Eigenvector-based Reputation Systems robust to collusion
H. Zhang, A. Goel, R. Govindian, K. Mason, B. V. Roy
Third Workshop on Algorithms and Models for the Web Graph 2004

Undue Influence : Elimination the Impact of Link Plagiarism on Web Search Rankings
Baoning Wu, Brian D. Davison
Université Lehigh, Bethlehem USA

Identifying Link Farm Spam Pages

Baoning Wu, Brian D. Davison
Université Lehigh, Bethlehem USA

Page-reRank : using trusted links to re-rank authority
Paolo Massa, Conor Hayes
ITC/iRst

Using Rank Propagation and Probabilistic Counting for Link-based Spam Detection
Luca Beccheti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates
DIS – Université de Rome « La Sapienza » et Yahoo ! Research

Link-Based Similarity Search to Fight Web Spam
Andras A. Benczur, Karoly Csalogany, Tamas Sarlos
Académie des Sciences Hongroise et Université Eotvos Budapest

Recognizing Nepotistic Links on the Web

Brian D. Davison
Université Rutgers

Site Level Noise Removal for Search Engines
Andre Luiz da Costa Carvalho, Paul-Alexandru Chirita, Edleno Silva de Moura, Pavel Calado, Wolfgang Nejdl
Université Fédérale de l’Amazone Manaus, L3S et Université de Hanovre, IST/INESC-ID Porto Salvo Portugal

SpamRank – Fully Automatic Link Spam Detection
Andras A. Benczur, Karoly Csalogany, Tamas Sarlos, Mate UherAcadémie des Sciences Hongroise et Université Eotvos Budapest

Link Spam Detection Based on Mass Estimation
Zoltan Gyöngyi, Hector Garcia-Molina, Pavel Berkin, Jan Pedersen
Université de Stanford, Yahoo !

Combating web spam with TrustRank.
Zoltan Gyöngyi, Hector Garcia-Molina, and Jan Pedersen.
In Proceedings of the 30th International Conference on Very Large Data Bases
(VLDB), 2004.
Université de Stanford, Yahoo !

A Cautious Surfer for PageRank
Lan Nie, Baoning Wu, Brian D. Davison

Department of Computer Science & Engineering Lehigh University Bethlehem

Transductive Link Spam DetectionDengyong Zhou, Christopher J.C. Burges,Tao Tao
Microsoft Corp.

Using Spam Farm to Boost PageRank
Ye Du,Yaoyun Shi,Xin Zhao
EECS Department, University of Michigan

Both Sides of the Digital Battle for a High Rank from a Search Engine
Timothy Jones
Department of Computer Science University of Otago (New Zealand)

An Analysis of Factors Used in Search Engine Ranking
Albert Bifet, Carlos Castillo, Paul-Alexandru Chirita, Ingmar Weber
Technical University of Catalonia, University of Chile, L3S Research Center, Max-Planck-Institute for Computer Science

An Analysis of Optimal Link Bombs
Sibel Adalna Liu, Malik Magdon-Ismail
Department of Computer Science, Rensselaer Polytechnic Institute

Integrating the Document Object Model with Hyperlinks for Enhanced Topic Distillation and Information Extraction
Soumen Chakrabarti
Indian Institute of Technology Bombay