How to get all the data Google database

169 views
Skip to first unread message

Jerry

unread,
Dec 23, 2011, 3:43:09 AM12/23/11
to Google Safe Browsing API
Today, i install GSB on my PC, but when i update database, i found the
chunknum not began to one, So i wang to konw this why and how can i
update database began one.

Patrick Kelley

unread,
Dec 29, 2011, 2:24:11 PM12/29/11
to google-safe-...@googlegroups.com, Google Safe Browsing API
URLs are often taken off the blacklist when the malware is removed from their server.

When all URLs from a given chunk are clean, then you are no longer required to download that chunk. This is what happened to chunk one. You are no longer required to download chunk one because the information it contained is no longer relevant, or it has been moved into another chunk for performance/efficiency reasons.

Sent from my iPhone

> --
> You received this message because you are subscribed to the Google Groups "Google Safe Browsing API" group.
> To post to this group, send email to google-safe-...@googlegroups.com.
> To unsubscribe from this group, send email to google-safe-browsi...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/google-safe-browsing-api?hl=en.
>

Jerry

unread,
Dec 30, 2011, 4:11:14 AM12/30/11
to Google Safe Browsing API
Hi Patrick,

Thank you for you answer, i know your means,but i mean how can i get
all database from google, now i got data is new data.

And other question, when i get chunk according to GSB v2 api, i want
to know there chunk's coding method. like v1 all data is md5 code, but
v2 when i open it, show garbled.

I am primary phper, now i use GSB demo, the source code form "http://
code.google.com/p/phpgsb/".

On Dec 30, 3:24 am, Patrick Kelley <peanutbutterkrac...@gmail.com>
wrote:
> URLs are often taken off the blacklist when the malware is removed from their server.
>
> When all URLs from a given chunk are clean, then you are no longer required to download that chunk.  This is what happened to chunk one.  You are no longer required to download chunk one because the information it contained is no longer relevant, or it has been moved into another chunk for performance/efficiency reasons.
>
> Sent from my iPhone
>

Garrett Casto

unread,
Jan 3, 2012, 3:24:00 AM1/3/12
to google-safe-...@googlegroups.com
The format of the chunks is described at
http://code.google.com/p/google-safe-browsing/wiki/Protocolv2Spec#3.6.1._shavar_list_format.
It's basically a bunch of hashes of "expressions"
(http://code.google.com/p/google-safe-browsing/wiki/Protocolv2Spec#6.2._Simplified_Regular_Expression_Lookup).
It also contains host keys for the expressions, but you can ignore
those.
Reply all
Reply to author
Forward
0 new messages