The biggest Problem I have with the Scheduler/GAE isn’t GAE it is Google Bot.
Under the new model you are on the hook for 15 minutes of time for an instance that spins up. Google Bot can’t be throttled on GAE. If you go in to WebMasters Tools you get a “Your site has been assigned a special crawl rate”. This was my favorite feature when I was paying for CPU cycles, but now when Google bot shows up and makes upwards of 100k requests in 5 minutes every 6 hours, Google Bot will be accounting for about 85% of the cost of hosting on several of the sites that have less than $50 a month in hosting costs under the current plan, that will be paying about $400 a month to serve requests to Google come November Pricing.
-Brandon
--
You received this message because you are subscribed to the Google Groups "Google App Engine" group.
To post to this group, send email to google-a...@googlegroups.com.
To unsubscribe from this group, send email to google-appengi...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
503 Service UnavailableThe server is currently unavailable (because it is overloaded or down for maintenance). Generally, this is a temporary state.
Nice, so basically Google is using their own service to tack on
additional charges to your bill. Doesn't sound ethical.
--
You received this message because you are subscribed to the Google Groups "Google App Engine" group.
To post to this group, send email to google-a...@googlegroups.com.
To unsubscribe from this group, send email to google-appengi...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
Returning a 503 is REALLY REALLY Bad for SEO.
304 Seems to be ignored by Google Bot on GAE, but also Google Bot will try queries to which there are no links, and which no user has ever made.
If your latency goes higher Google Bot will throttle back… but the only way to slow down the page serving is to put a wait timer, which burns Instance time… Rob Paul to Pay Peter…
Though I do kind of think that the bandwidth consumed by GOOG on GOOG
infrastructure should be free.
If only...
Your crawl rate changed because the Headers for serving changed from GAE to CF In a Week they will be back to normal.
From: google-a...@googlegroups.com [mailto:google-a...@googlegroups.com] On Behalf Of Roch Delsalle
Sent: Sunday, September 11, 2011 5:02 AM
To: google-a...@googlegroups.com
Subject: [google-appengine] Re: Google Bot Is Your Enemy
Here is what I noticied : http://www.d-ro.ch/2011/09/appengine-cloudflare-crawlrate
Anyway you shouldn't block Googlebot that would de-index you form google.
--
You received this message because you are subscribed to the Google Groups "Google App Engine" group.
To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/ah8AAXrX-kMJ.
Without the Domain name, I don’t. But a trick you can use to get deeper indexing on your pages is to change the server headers, IP address, and Expiration every week or so.
But Google bot responds to Change with “lets crawl everything” when it finds less change, it reduces the crawl.
From: google-a...@googlegroups.com [mailto:google-a...@googlegroups.com] On Behalf Of Roch Delsalle
Sent: Sunday, September 11, 2011 2:17 PM
To: google-a...@googlegroups.com