Roadrunner: Towards automatic data extraction from large web sites V Crescenzi, G Mecca, P Merialdo VLDB 1, 109-118, 2001 | 1590 | 2001 |
Automatic information extraction from large websites V Crescenzi, G Mecca Journal of the ACM (JACM) 51 (5), 731-779, 2004 | 255 | 2004 |
Grammars have exceptions V Crescenzi, G Mecca Information Systems 23 (8), 539-565, 1998 | 225 | 1998 |
Automatic annotation of data extracted from large Web sites. L Arlotta, V Crescenzi, G Mecca, P Merialdo WebDB, 7-12, 2003 | 168 | 2003 |
Clustering web pages based on their structure V Crescenzi, P Merialdo, P Missier Data & Knowledge Engineering 54 (3), 279-299, 2005 | 126 | 2005 |
RoadRunner: automatic data extraction from data-intensive web sites V Crescenzi, G Mecca, P Merialdo Proceedings of the 2002 ACM SIGMOD international conference on Management of …, 2002 | 105 | 2002 |
Probabilistic models to reconcile complex data from inaccurate data sources L Blanco, V Crescenzi, P Merialdo, P Papotti Advanced Information Systems Engineering: 22nd International Conference …, 2010 | 86 | 2010 |
Extraction and integration of partially overlapping web sources M Bronzi, V Crescenzi, P Merialdo, P Papotti Proceedings of the VLDB Endowment 6 (10), 805-816, 2013 | 85 | 2013 |
The (Short) Araneus Guide to Web-Site Development. G Mecca, P Merialdo, P Atzeni, V Crescenzi, V Crescenzi WebDB (Informal Proceedings), 13-18, 1999 | 56 | 1999 |
The ARANEUS Guide to Web-Site Development. G Mecca, P Merialdo, P Atzeni, V Crescenzi SEBD 1999, 167-177, 1999 | 46 | 1999 |
Automatic Web Information Extraction in the RoadRunner System V Crescenzi, G Mecca, P Merialdo Conceptual Modeling for New Information Systems Technologies: ER 2001 …, 2002 | 44 | 2002 |
Wrapping-oriented classification of web pages V Crescenzi, G Mecca, P Merialdo Proceedings of the 2002 ACM symposium on Applied computing, 1108-1112, 2002 | 40 | 2002 |
Wrapper inference for ambiguous web pages V Crescenzi, P Merialdo Applied Artificial Intelligence 22 (1-2), 21-52, 2008 | 35 | 2008 |
A framework for learning web wrappers from the crowd V Crescenzi, P Merialdo, D Qiu Proceedings of the 22nd international conference on World Wide Web, 261-272, 2013 | 34 | 2013 |
Web content extraction: a metaanalysis of its past and thoughts on its future T Weninger, R Palacios, V Crescenzi, T Gottron, P Merialdo ACM SIGKDD Explorations Newsletter 17 (2), 17-23, 2016 | 29 | 2016 |
Crawling programs for wrapper-based applications C Bertoli, V Crescenzi, P Merialdo 2008 IEEE International Conference on Information Reuse and Integration, 160-165, 2008 | 29 | 2008 |
Crowdsourcing for data management V Crescenzi, AAA Fernandes, P Merialdo, NW Paton Knowledge and Information Systems 53, 1-41, 2017 | 26 | 2017 |
Supporting the automatic construction of entity aware search engines L Blanco, V Crescenzi, P Merialdo, P Papotti Proceedings of the 10th ACM workshop on Web information and data management …, 2008 | 25 | 2008 |
Crowdsourcing large scale wrapper inference V Crescenzi, P Merialdo, D Qiu Distributed and Parallel Databases 33, 95-122, 2015 | 24 | 2015 |
Efficiently Locating Collections of Web Pages to Wrap. L Blanco, V Crescenzi, P Merialdo WEBIST, 247-254, 2005 | 21 | 2005 |