A flexible learning system for wrapping tables and lists in HTML documents WW Cohen, M Hurst, LS Jensen Proceedings of the 11th international conference on World Wide Web, 232-241, 2002 | 372 | 2002 |
A structured wrapper induction system for extracting information from semi-structured documents W Cohen, L Jensen Proceedings of the Workshop on Adaptive Text Extraction and Mining (IJCAI’01), 2001 | 75 | 2001 |
Extracting person names from diverse and noisy OCR text TL Packer, JF Lutes, AP Stewart, DW Embley, EK Ringger, KD Seppi, ... Proceedings of the fourth workshop on Analytics for noisy unstructured text …, 2010 | 59 | 2010 |
Improving text classification by using conceptual and contextual features LS Jensen, T Martinez Brigham Young University. Department of Computer Science, 2000 | 37 | 2000 |
Grouping extracted fields LS Jensen, W Cohen Proceedings of the IJCAI-2001 Workshop on Adaptive Text Extraction and …, 2001 | 14 | 2001 |
Web Document Analysis: Challenges and Opportunities, chapter A Flexible Learning System for Wrapping Tables and Lists in HTML Documents W Cohen, M Hurst, L Jensen World Scientific, 2003 | 9 | 2003 |
Exploiting sequential relationships for familial classification LS Jensen, JG Shanahan Proceedings of the 19th ACM international conference on Information and …, 2010 | 2 | 2010 |
A Wrapper Induction System for Complex Documents, and its Application to Tabular Data on the Web WW Cohen, M Hurst, LS Jensen Web Document Analysis: Challenges and Opportunities, 155-177, 2003 | 2 | 2003 |