Wrong email or password! Try one more time.

Forgot password?

An account with this email already exists.

An email with a confirmation link has been sent to you.

Did you forget your password? Don't panic. Enter your email address,
and we will email you a link where you may create a new password.

If this address exists, we will send you an email with further instructions.

Back to authentication

How can I optimize for “deep web” crawling?

How can I optimize for “deep web” crawling? - answered by Matt Cutts

Matt's answer:

CUTTS: We have a question from Brighton. Danny asks, “What are Google’s plans for indexing the deep web? And are there best practices for form construction to optimize for this?” Great question. We recently published a paper in VLDB, which I believe stands for Very Large Data Bases, that talks exactly about our criteria all the way–so we try to do it safely so that if people don’t want their forms to be crawled, we won’t crawl them. And so there are some very simple things you can do. So, rather than having text that has to be filled out, like a zip code, if you can make it a dropdown, for example, that’s much more helpful. If you can make it so that it is not a huge form with 20 things to fill out, but more like one dropdown or two dropdowns, that’s going to be a lot easier as well. I definitely encourage you to go read the paper. There’s nothing super-duper confidential in it. And, of course, if you can make it so you’re not part of the deep web, if you can take those pages that’s your database and have an HTML site maps so that people can reach all the different pages on your site by crawling through categories or, you know, geographic areas, then we don’t have to fill up forms. And Google is a pretty good company about being able to index the deep web through forms, but not every search engine does that. And so if you can expose that database somewhere where people can get to all the pages on your site just by clicking not by submitting a form, then you’re going to open yourself up to even wider audiences. So, if you can do that, that’s what I’d recommend. But if you can’t do that, than I’d say check out this paper from the VLDB conference where the team talked about it in more detail.

by Matt Cutts - Google's Head of Search Quality Team


Original video: