PDA

View Full Version : Scroogle down for good ?



Renk
02.07.10, 03:05
July 1, 2010: Here we go again...

We regret to announce that our Google scraper may have to be permanently retired, thanks to a change at Google. It depends on whether Google is willing to restore the simple interface that we've been scraping since Scroogle started five years ago. Actually, we've been using that interface for scraping since Google-Watch.org began in 2002.

This interface (here's a sample from years ago) was remarkably stable all that time. During those eight years there were only about five changes that required some programming adjustments. Also, this interface was available at every Google data center in exactly the same form, which allowed us to use 700 IP addresses for Google.

That interface was at Google Toolbar (http://www.google.com/ie) but on May 10, 2010 they took it down and inserted a redirect to /toolbar/ie8/sidebar.html. It used to have a search box, and the results it showed were generic during that entire time. It didn't show the snippets unless you moused-over the links it produced (they were there for our program, so that was okay), and it has never had any ads. Our impression was that these results were from Google's basic algorithms, and that extra features and ads were added on top of these generic results. Three years ago Google launched "Universal Search," which meant that they added results from other Google services on their pages. But this simple interface we were using was not affected at all.

It is not possible to continue Scroogle unless we have a simple interface that is stable. Google's main consumer-oriented interface that they want everyone to use is too complex, too bloated, and changes too frequently, to make our scraping operation possible.

After a lot of suggestions from Scroogle users, and a fair amount of publicity, we found a fix and Scroogle was back in 24 hours. This fix was to insert an extra parameter, &output=ie, into the search terms that were relayed to Google. The extra parameter recovered the same interface that we thought was gone forever.

Now it seems like it actually might be gone forever. Late on June 30, 2010, the results produced while using this parameter began to shift to the usual busy Google interface with ads and a left-margin sidebar. Scroogle users saw a Scroogle page that said, "Google returned no results for this search," when in fact Google returned results but our scraper was unable to deal with them. Over the next few days we will attempt to contact Google and determine whether the old interface is gone as a matter of policy at Google, or if they simply have it hidden somewhere and will tell us where it is so that we can continue to use it.

Thank you for your support during these past five years. Check back in a week or so; if we don't hear from Google by next week, I think we can all assume that Google would rather have no Scroogle, and no privacy for searchers.

— Daniel Brandt, Public Information Research, scroogle AT lavabit.com


https://ssl.scroogle.org/cgi-bin/nbbwssl.cgi

slikrapid
02.07.10, 14:31
related topic:

http://www.sb-innovation.de/showthread.php?threadid=20116

MoS
02.07.10, 14:35
noticed that today and really hope it isn't permanent down..

anon
02.07.10, 15:40
They were able to get through it last time. Hopefully it'll be the same now... I don't want to use Google directly, damn cookies.

shoulder
02.07.10, 19:15
You can block them. :wink2:

anon
02.07.10, 19:16
Disabling cookies for specific sites doesn't survive across restarts in Opera. :dabs:

shoulder
02.07.10, 19:26
No plugin for that? (Firefox vs Opera, 1:0 :P)

anon
02.07.10, 19:27
Plugin? What's that? (2:0)

No, seriously, it's a bug. And a rather serious one for me.

shoulder
02.07.10, 19:29
No plugin support? :eek:

anon
02.07.10, 20:10
You have widgets, but they're far from being the same...

SBfreak
02.07.10, 20:14
Scroogle dead?
Yeyno?
Millions of people use google.
You guys should stick with it too.Stop with the paranoia already..
or move to a better browser:rolleyes2: and do something about it.

anon
02.07.10, 20:15
Scroogle dead?
Yeyno?
Millions of people use google.

Their loss.


or move to a better browser:rolleyes2:

Yeah, I know I should move from Opera 10.10 to Opera 10.60 already. :happy:

SBfreak
02.07.10, 20:17
Does 10.60 have plugins?:tongue:

anon
02.07.10, 20:18
No, but it's a better browser.

Ticko
02.07.10, 20:29
what is scroogle...i have been using google for almost a decade...have i been doing it wrong?? lol

anon
02.07.10, 20:29
what is scroogle...

It's the same thing with a clean layout, no tracking cookies and no IP logs.

SBfreak
02.07.10, 20:31
.have i been doing it wrong?? lol
Yes Google now knows what porn you've been watching.
Criticism of Google - Wikipedia, the free encyclopedia (http://en.wikipedia.org/wiki/Criticism_of_Google#Cookies)

anon
03.07.10, 00:27
IXQuick's a temporal alternative. IP addresses aren't logged, and it doesn't only search for stuff in Google, but multiple engines. If you access via startpage.com, it doesn't use cookies, either. Give it a try:

http://www.startpage.com/

Here's the URL to use for searching directly from your address bar:

http://www.startpage.com/do/metasearch.pl?query=%s

The interface isn't as clean as Scroogle's and there are ads, but you can't have everything in life.

SBfreak
03.07.10, 02:12
Is it based on google??
I mean will I get approximately the same results as google right?

Renk
03.07.10, 03:54
Is it based on google??
I mean will I get approximately the same results as google right?


No, Ixquick is not based on Google.
It uses All the Web, Digg, Qkport, Ask/Teoma, EntireWeb, Wikipedia, Bing, Gigablast, Yahoo, Cuil and Open Directory.

Another Ixquick's advantage is that you can connect to the site you are searching through Ixquick's own proxy.


But as anon said, Ixquick's interface is not so clean than scroogle's.

An other alternative could be Yauba (http://www.yauba.com/). You can search sites, images, videos, blogs, documents (pdf, word, powerpoint...). As with Ixquick, you can access to the site your are searching through yauba's proxy.

But Yauba is s not a metasearch engine, Yauba has no https version, and imo Ixquick's privacy reputation is far better (some people are even thinking that Yauba could be a honeypot).

anon
03.07.10, 17:46
Is it based on google??
I mean will I get approximately the same results as google right?

As Renk said, it searches multiple engines. The number of stars after a result indicates how many of them returned that same result. A search for "SB-Innovation" has us in the second place with five stars.

anon
07.07.10, 15:48
Scroogle is back! :w00t:

SBfreak
07.07.10, 16:08
Not for long:shifty:...:unsure:
After all scrapping google results is against against their terms of service.

anon
07.07.10, 18:37
After all scrapping google results is against against their terms of service.

True. Also, Scroogle has few servers while Google has hundreds - I don't know why they don't simply block the former's IPs if they have a problem with scraping.