References

Barbosa, L. and J. Freire. "Siphoning Hidden-Web Data through Keyword-Based Interfaces." SBBD 2004: 309–321.

Bergman, M. K. "The Deep Web: Surfacing Hidden Value." Journal of Electronic Publishing, 2001.

Callan, J. P. and M. E. Connell. "Query-based sampling of text databases." ACM Transactions on Information Systems, 19(2): 97–130, 2001.

Doan, A., P. Domingos, and A. Y. Halevy. "Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach." SIGMOD Conference 2001: 509–520.

"Forms in HTML documents." http://www.w3.org/TR/html401/interact/forms.html.

He, B., M. Patel, Z. Zhang, and K. C.-C. Chang. "Accessing the Deep Web: A survey." Communications of the ACM, 50(5): 95–101, 2007.

Ipeirotis, P. G. and L. Gravano. "Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection." VLDB 2002: 394–405.

Madhavan, J., L. Afanasiev, L. Antova, and A.Y. Halevy. "Harnessing the Deep Web: Present and Future." CIDR 2009.

Madhavan, J., S. Jeffery, S. Cohen, X. Dong, D. Ko, C. Yu, and A. Y. Halevy. "Web-scale Data Integration: You can only afford to Pay As You Go." CIDR 2007.

Madhavan, J., D. Ko, L. Kot, V. Ganapathy, A. Rasmussen, and A. Y. Halevy. "Google's Deep-Web Crawl." PVLDB 1(2): 1241–1252 (2008).

Ntoulas, A., P. Zerfos, and J. Cho. "Downloading textual hidden web content through keyword queries." JCDL 2005: 100–109.

Raghavan, S. and H. Garcia-Molina. "Crawling the Hidden Web." VLDB 2001: 129–138.

Salton, G. and M. J. McGill. Introduction to Modern Information ...

Get Beautiful Data now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.