Effectivement, d'aprés une annonce sur son blog officiel, Google annonce qu'il expérimente une méthode pour chercher dans le "Deep Web". C'est à dire faire des recherches dans les pages web seulement accéssibles aprés avoir renseigné un formulaire.
In the past few months we have been exploring some HTML forms to try to discover new web pages and URLs that we otherwise couldn't find and index for >users who search on Google. Specifically, when we encounter a <FORM> element on a high-quality site, we might choose to do a small number of queries using >the form. For text boxes, our computers automatically choose words from the site that has the form; for select menus, check boxes, and radio buttons on the >form, we choose from among the values of the HTML. Having chosen the values for each input, we generate and then try to crawl URLs that correspond to a >possible query a user may have made. If we ascertain that the web page resulting from our query is valid, interesting, and includes content not in our index, we >may include it in our index much as we would include any other web page.
Bien sûr, "the ever-friendly Googlebot", comme le nomme Google, respectera toujours les directives noindex ou nofollow qui interdisent l'indexation de la page...
A quand le bot qui fera du brute-force pour passer les demandes de mots de passe ?!
Par Alex

Ajouter un commentaire