Google provides two services - a search engine and a site cache.
As far as I know it's possible to "opt out" of either.
So, you can have Google ignore you completely.
Or you can have Google list you in its index, make copies of all your content and serve it up to anyone who wants it.
Or you can have Google list you in its index, but not take copies of your content.
Both should be "opt in", in my opinion. If I want Google to list me, I should have to ask them to. If I want them to make a copy of my site, I should have to ask them to.
Put briefly, "Opt out" sucks.
And speaking of briefly, I've had an enough of this thread. No doubt you have too. Thanks for your time, effort and contributions - even if I disagree with the spirit of most of them.
I'll probably build my tool anyway, so I can monitor links to the sites I'm responsible for. I don't have the time to do the actual monitoring myself, and I'd rather not have to pay someone to do it. If the software solution gets me into trouble, I may "cease and desist" and go for a "pecking chicken" solution - fire up the query manually, move the pointer to the "next" link and get some mechanical device to click on the mouse button every so often. Then I can come back and parse my cached pages as I see fit. It's just that the pure software solution seems more elegant.