« Response to "Fighting Abuse with Abuse Is A Terrible Idea" | Main | Webhosting Site Goes Live (Finally) »
Now that there is a way for blogspot blog to add Google sitemap, and also a way to ping Google sitemap, does this mean that if a new blog have not been indexed yet, by adding Google sitemap to their blog and then pinging Google sitemap, one can get one's blog indexed faster?
This is part of a response I made to an email on the Yahoo Blogger support group to the above question. There had been a few questions on the subject in recent weeks so I thought I would post it here as post.
The answer or in this case answers is yes and no and now that I think about it, maybe....
If you host your blog on their servers (blogspot), it will take 3-7 days for them to find it.
If you host your blog on your own server, it will take around 5-7 days for them to find it. (using blogger that is.. If say you use Moveable Type, then it will take them forevre to find it if you don't have any inbound links or tell them about it).
If you add a google sitemap and tell them about it, it will still take 3-7 days for them to get round to indexing it.
Also remember that just because you've told google about your site in say hours... It doesn't mean they will do anything with the data for weeks. It all comes down to the quality of the site content (and that goes with physical content as well site construction (coding, colour schemes etc)). Just telling Google about a site is a fraction of the story.
But, here is the word of warning I gave out before...
Yes Google have provided a way for you to authenticate your site by the use of a META Tag...
In order to prove that the sitemap you have submitted to Google belongs to you, they used to get you to upload an 'verification file' to your root directory.
Google would then know that you had access to that folder and the chances were that the site was yours.
Because some Blogspot users can't do this (because you host on blogspot not on your own web server), they provided a method for users to use a META TAG inserted in their template to provide the authentication.
But being on blogspot you cannot generate a SITEMAP.XML (or SITEMAP.XML.GZ - the compressed version).
Usually, this would be a file that contains ALL of the pages on your site. So in the case of one of my personal blogs, a list to over 2500 individual pages.
There is nothing to stop you generating this SITEMAP file no matter who you are hosted with, but if you are on blogspot you cannot upload it to your sites root folder (because you don't have FTP access to that folder).
You cannot place it on a remote machine and then point Google Sitemaps to it because it has to be on the root folder of your website.
But don't despair... You have two options, one of which is a tadge sneaky, but still alllowed.
Now when google scan our site they will find the link to the sitemap, find the sitemap file and hopefully crawl its contents.
We can even place links to sitemaps of different formats. (You can have excel (csv), text (csv), Yahoo flavoured sitemap files, and many others).
One would hope that google would ignore all others and go for the Google flavoured one. So yes you can use Google Sitemaps if you are a blogspot user, but you have to make changes to your ATOM file (feed settings) that may make that file unusable for your readers or simply not contain any content of any real use.
I am yet to work out if is beneficial to do both (1) and (2), but don't know if Google would then not bother to further crawl your site if they already have your ATOM File. (Again, you would hope that your index page would be contained as a link in the ATOM file, which would in turn contain a link to the externally hosted sitemap file, which ulitmately would still get indexed by Google).
I would still like to think that at some point in time, Google would allow Blogspot users to upload, or better still, they would have a tool attached to the Blogspot user interface that allows you to create a sitemap file and then a button to tell them that you have generated one.
When you think that it is google that owns Blogspot, you would hope that they would allow you to host one simple file that would make their own lifes easier.
But then that brings up the point that Blogspot is actually hosted on their own machines anyway, and the simplest way for them to crawl ALL blogspot servers would be as simple as doing "ls -a or dir to all you windows people". Your site is on their machine, you'd think they'd know whats on their own machines. After all, the purpose of google sitemap is to tell them of sites and site content they don't know about !
Posted by Chris Wright at May 15, 2006 11:22 AM
TrackBack URL for this entry:
http://www.yaps4u.net/cgi-bin/mt/mt-tb.cgi/234