atom feed8 messages in com.googlegroups.google-ajax-search-apiRe: Getting Site Index Status
FromSent OnAttachments
Vikrant SinghSep 27, 2010 4:23 am 
Jeremy GeerdesSep 27, 2010 5:41 am 
Vikrant SinghSep 28, 2010 1:17 am.Other
Jay VasdewaniSep 28, 2010 1:21 am 
Vikrant SinghSep 28, 2010 1:58 am 
Jeremy GeerdesSep 28, 2010 3:47 am 
Jay VasdewaniSep 28, 2010 3:58 am 
Vikrant SinghSep 29, 2010 10:16 pm 
Subject:Re: Getting Site Index Status
From:Jay Vasdewani (jvas@gmail.com)
Date:Sep 28, 2010 3:58:26 am
List:com.googlegroups.google-ajax-search-api

Easiest thing to do is set up Google Alert for with the exact content you have on the home page and there is an option As it Happens. So as soon as its indexed, you will be the alert.

On Tue, Sep 28, 2010 at 4:17 PM, Jeremy Geerdes <jrge@gmail.com> wrote:

I think that the main difference between info: and site: is that the info: operator will only confirm that a site is in Google's index, whereas the site: operator should tell you what pages have actually been indexed. So info should give you a link to the site's front page, plus a few other links to get more information, while site should list all the different pages that Google knows about on the site. That said, rather than rolling your own application to check whether a site is in Google's index, you might want to consider pointing them to Google's Webmaster Tools suite. This service can be tremendously helpful in determining how - and even when - Google is indexing a site by helping webmasters develop sitemaps, analyzing keywords that have brought traffic, and more. They can call for GoogleBot to index their site on-demand.

Jeremy R. Geerdes Effective website design & development Des Moines, IA For more information or a project quote: http://jgeerdes.home.mchsi.com jrge@gmail.com If you're in the Des Moines, IA, area, check out Debra Heights Wesleyan Church! On Sep 28, 2010, at 3:58 AM, Vikrant Singh wrote:

Yes I tried that, please see the bottom of my reply...

As you pointed out it may be because of unreliable nature of such search. But when I run the same query with a info: instead of site: i get correct results for the same domain. So it works with info: but does not works with site:

Can you help me inunderstanding this behaviour? Or if you help me in differentiaitng between site: and info:  (in this context), then it will be realy helpful.

On Tue, Sep 28, 2010 at 1:51 PM, Jay Vasdewani <jvas@gmail.com> wrote:

instead of site: try info:domainname

On Tue, Sep 28, 2010 at 1:47 PM, Vikrant Singh <vikk@gmail.com> wrote:

Thanks for quick reply Jeremy.

I am not into SEO, but I work with a web hosting company and we provide a feature where we enable the customer to check whether their site is indexed by popular search engines or not. I am facing some randm issue and need some help on that only. Please guide me to correct forum if it is not the one.

Coming back to question  when I search for site:domainname.com on google search webpage I get customer's site listed in the resulted.

But when I run a query like this (just a sample, removed the original site name), in my browser

http://www.google.com/uds/GwebSearch?context=0&lstkp=0&rsz=large&hl=en&gss=&sig=&q=site:domainname.com&v=1.0

it does nto give me incorrect data.

It returns me something like following...

{"responseData":

{"results":[],"cursor":{"moreResultsUrl":"http://www.google.com/search?oe\u003dutf8\u0026ie\u003dutf8\u0026source\u003duds\u0026start\u003d0\u0026hl\u003den\u0026q\u003dsite:domainname.com"}}, "responseDetails": null, "responseStatus": 200}

If you notice result is [] here, i.e. empty.  This is not happenign with every site but with few customers.

If you replace the domainname.com witha  valid domainname you will get a valid data. I checked it for facebook.com and I got the attached data.

As you pointed out it may be because of unreliable nature of such search. But when I run the same query with a info: instead of site: i get correct results for the same domain. So it works with info: but does not works with site:

Can you help me inunderstanding this behaviour? Or if you help me in differentiaitng between site: and info:  (in this context), then it will be realy helpful.

On Mon, Sep 27, 2010 at 6:11 PM, Jeremy Geerdes <jrge@gmail.com> wrote:

You can try to send the same site:domainname.com query to the Search API, but the response will not be particularly reliable. To be frank, the API struggles with searches that include operators, particularly site: . Alternatively, you could set up a Custom Search Engine which is restricted to a given site, restrict the Search API to the CSE, and then submit your query. This is generally far more reliable than the other method. However, this is probably more work than just running the query in Google manually.

All of this said, I should note that, generally speaking, this kind of search is done by SEO operators who have a tendency to test a large number of domains automatically. If this is your intention, you must note that the API's TOS prohibit the use of robots, spiders, and other applications which submit large numbers of automated queries.

Jeremy R. Geerdes Effective website design & development Des Moines, IA

For more information or a project quote: http://jgeerdes.home.mchsi.com jrge@gmail.com

If you're in the Des Moines, IA, area, check out Debra Heights Wesleyan Church!

On Sep 27, 2010, at 6:23 AM, Vikrant Singh wrote:

Hello All,

How I can find out if a particular domain name is indexed by google or not? On a google search home page it is possible by searching for "site:domainname.com" or "info:domainname.com" , but I want to know how to do it programmatically?