I'm running e2guardian development branch with man-in-the-middle explicit proxy enabled on Debian 8. I used the configure options from FredBcode's sticky post titled "E2guardian Debian Package." I've also uncommented the new textmimetypes option in hopes it would filter youtube searches (I inquired about this in an earlier post, but received no replies).
With textmimetypes on, e2guardian still does not filter Youtube searches.
In all my tests, I search for "nude women."
In Firefox, when I browse to google.com and search from the site, the search is properly filtered. However, searches from within the Firefox's search bar are not filtered (the search provider is Google).
The opposite is true in Internet Explorer 11. Searches from google.com are not filtered, but searches in the search bar are correctly filtered (again Google is the provider).
Searches with Yahoo from Firefox are not filtered with either method.
Bing.com offers the only consistently acceptable behavior; both types of searches are filtered correctly, including those from https://bing.com.
EDIT: The search in Firefox for Windows from the google.com site doesn't work after all; the e2guardian block screen appears as expected. However, when I press the back button, the search results appear as if never filtered. Arrgh!
So... I'm frustrated. This is an awkward question to ask here, but would a paid service such as SafeSquid provide better results? Or am I doomed to run in circles with any MTIM filter?
I would appreciate some encouraging news. Do others observe the same behavior? Can I tweak my configuration somehow?
Can someone explain why it doesn't work as expected?
This is the access.log entry from the Firefox search bar attempt:
2015.12.3 15:52:14 - 192.168.1.4 https://www.google.com/search?q=nude+women&ie=utf-8&oe=utf-8 GET 139340 0 1 200 text/html
--
You received this message because you are subscribed to the Google Groups "e2guardian" group.
To unsubscribe from this group and stop receiving emails from it, send an email to e2guardian+...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.
Sorry to keep pestering you...
I replaced the existing searchregexplist with the version you sent and uncommented the corresponding line in e2guardianf1.conf.
I chose "method 2" so I set the searchtermlimit to 50.
Now a search for a non-naughty phrase such as "star wars trailer" causes a "connection was reset" page to appear on any of the search providers listed in the searchregexplist.
This occurs even on unsecure http://bing.com.
Did I miss something? I don't suppose Protex has an unadvertised "home user" version. My needs are pretty simple; just filter searches on the major sites and Youtube.
Thanks.
Jason
From: jho...@bruinmail.slcc.edu
To: "e2guardian" <e2gua...@googlegroups.com>
Cc: "philip pearce" <philip...@e2bn.org>
Sent: Wednesday, 9 December, 2015 8:56:10 PM
Subject: Re: I feel a bit discouraged
FredB,
I'm sorry to be so ignorant. I'm not sure where to look for error logs. This is the entry from /var/log/messages but I doubt it is what you need:
Dec 9 15:47:10 zombie kernel: [ 242.536151] e2guardian[1000]: segfault at 5b570a8 ip 00000000004915ed sp 00007ffe1d0d89b8 error 4 in e2guardian[400000+ed000]
----
I built the most recent development branch downloaded a few minutes ago. I only made two changes to the default configuration.
1. Uncommentted searchregexplist = '/etc/e2guardian/lists/searchregexplist'
2. Uncommented and set searchtermlimit = 50
I did not modify any other files. I am using Debian 8.2 amd64
This error occurred when I searched for "star wars" on http.bing.com (not https). When searching for "nude women" from the same page, e2guardian blocked the search as expected.
Thanks
Jason
I only chose bing.com because it will allow http searches and I wanted to rule out any errors associated with MITM. When I enable MITM, the same error occurs with google searches.
If I comment out the bing.com entries in searchregexplist, no error occurs.
Thanks for your efforts.
----
2015.12.10 8:57:32 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=s&cp=1&o=hs&css=1&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 49440 0 1 200 text/html - -
2015.12.10 8:57:32 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=st&cp=2&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3085 0 1 200 text/html - -
2015.12.10 8:57:33 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=sta&cp=3&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3156 0 1 200 text/html - -
2015.12.10 8:57:33 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=star&cp=4&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3164 0 1 200 text/html - -
2015.12.10 8:57:34 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=star%20&cp=5&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3259 0 1 200 text/html - -
2015.12.10 8:57:34 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=star%20w&cp=6&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3333 0 1 200 text/html - -
2015.12.10 8:57:35 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=star%20wa&cp=7&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3359 0 1 200 text/html - -
2015.12.10 8:57:35 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=star%20war&cp=8&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3370 0 1 200 text/html - -
2015.12.10 8:57:36 - 192.168.1.4 http://www.bing.com/AS/Suggestions?pt=page.home&mkt=en-us&qry=star%20wars&cp=9&o=hs&cvid=97A3A0AE190B4F8AB22A5969ACB7CBF6 GET 3361 0 1 200 text/html - -
It is from /var/log/e2guardian/access.log
I will send you my files, but I'll remind you that I did a fresh build and haven't modified anything other than the two lines in the e2guardianf1.conf file.
Thanks
From: jho...@bruinmail.slcc.edu
To: "e2guardian" <e2gua...@googlegroups.com>
Sent: Friday, 11 December, 2015 3:21:47 PM
Subject: Re: I feel a bit discouraged
Yes I can confirm the page reset corresponds with the seg fault.
I will send you my files, but I'll remind you that I did a fresh build and haven't modified anything other than the two lines in the e2guardianf1.conf file.
Thanks
Method 2 no longer causes a page reset! But... it doesn't filter the search.
Method 1 works as expected. I entered the word "hello" in the bannedsearchlist and any search on google.com or youtube.com is blocked for the reason: Banned Search Words: hello. Category is N/A
If I search for "nude women" on Youtube, the search is not blocked. The same search is blocked on google.com for the reason: Blocked URL. Category: Banned Regular Expression URL.
If method 2 were working for "nude women", which reason would be listed? I assume it would be "Banned Search Words" as in method 1.
For method 2, my searchtermlimit is 10. This was built after your most recent commit where you set weightedphrasemode = 1
Thanks for your efforts.
Jason
After viewing the phrase lists for nudism, I tried searching for "nude beach" on youtube.com and the search was blocked for reason: Weighted search term limit exceeded.
So it is working after all; but apparently I'll need to tweak the lists. Maybe method 1 would be better after all.
Thanks again.
Jason
'Weighted search term limit exceeded.'
Assuming you are using the standard phrase lists the reason 'nude women' is not being
blocked is as follows:-
<nude><10> has a weight of 10
<women> does not appear in the list
So the total weight of 'nude women' is 10
This does not exceed your limit of 10 so it is allowed.
So, either, reduce searchtermlimit to less than 10
or increase the weighting on 'nude'
or insert a specific phrase for <nude women><11> with a higher rating.
I've tried with a searchtermlimit of 9 and all works as expected, Google and Youtube
searches for 'nude women' are blocked with 'Weighted search term exceeded'
The Blocked URL. Category: Banned Regular Expression URL. is not a search specific
function. It is a simplistic catch all regular expression which due to to way it's
constructed is not very effective as you can see as it worked with Google but not
youtube. In my view Banned Regular Expressions are best avoided as they tend to
give inconsistent results and often over block.
From: jho...@bruinmail.slcc.edu
To: "e2guardian" <e2gua...@googlegroups.com>
Sent: Monday, 14 December, 2015 7:56:56 PM
Subject: Re: I feel a bit discouraged