arXiv
Open Access
2012
Design, implementation and experiment of a YeSQL Web Crawler
Pierre Joulin
Romain Deveaud
Eric SanJuan-Ibekwe
Jean-Marc Francony
Françoise Para
Abstrak
We describe a novel, "focusable", scalable, distributed web crawler based on GNU/Linux and PostgreSQL that we designed to be easily extendible and which we have released under a GNU public licence. We also report a first use case related to an analysis of Twitter's streams about the french 2012 presidential elections and the URL's it contains.
Topik & Kata Kunci
Penulis (5)
P
Pierre Joulin
R
Romain Deveaud
E
Eric SanJuan-Ibekwe
J
Jean-Marc Francony
F
Françoise Para
Akses Cepat
Informasi Jurnal
- Tahun Terbit
- 2012
- Bahasa
- en
- Sumber Database
- arXiv
- Akses
- Open Access ✓