jmdugan / blocklists Goto Github PK
View Code? Open in Web Editor NEWShared lists of problem domains people may want to block with hosts files
License: Creative Commons Zero v1.0 Universal
Shared lists of problem domains people may want to block with hosts files
License: Creative Commons Zero v1.0 Universal
I need to break the habit :D
First of all, thanks for providing these great lists. Of primary interest is the FB block list.
One thing I ran into after adding the complete FB block list to my /etc/hosts file, I'm unable to use React developer tools or download the SDK bundle from https://connect.facebook.net/en_US/sdk.js
.
I'm looking for anyone who has run into this same issue and has tips or thoughts.
Is it enough to whitelist the SDK bundle URL listed above? Or are there other parts of React.js framework that need to connect to other FB URLs?
Disclaimer: I'm not trying to use the FB website, just as a developer working with React.js framework at work.
Thanks in advance for any insight into this issue.
Hello,
Nice work !
I see these unblocked addresses in pihole :
Could these be added ?
Regards
edge-mqtt.facebook.com
graph.facebook.com
graph.oculus.com
time.facebook.com
The Google list could do with including domains used by Google's AMP project, including:
It was suggested to me over at GitLab that I pull in missing hosts for my new ZuckerBlock hosts-file. It includes Giphy hosts, as well as Oculus & Libra, and is CC0
. So have a look, pull in what you need, and I'll do the same as we are both using the CC0
license. The repo is here: https://gitlab.com/intr0/zuckerblock. Peace.
Many of us who use Samsung phones with a lot of bloatware disabled would still love to have more control over blocking more Samsung domains which we cannot control by disabling apps.
Thank you!
I don't know if there is a reason these were left out, but I'd like to add:
iphone.facebook.com
mobile.facebook.com
touch.facebook.com
are there plans to include other sites that treat users as chattel? instagram, twit*, et al
Have you considered collaborating with http://someonewhocares.org/hosts/? They've got 12K+ /etc/hosts entries.
what i need to remove that i get instagram working? (i already removed all instagram related hosts but there is still some)
and if you know direct ip addresses for instagram services, please list them, so people can use that list to allow trafic to them. :)
thanks.
Hello everyone,
how did you know many bad domains of Mark Zuckerberg? More than 50 domains. It looks really crazy because Mark Zuckerberg registered much unwanted domains. That is why Mark Zuckerberg is big holy shit man and you are hero because I am really hostile to Facebook.
How do I block fucking adversting video from youtube. I really hate Facebook. I always argue to Mark Zuckerberg.
Because he is bad to innocent world.
Thanks for ANTI FACEBOOK! I AM TOO
Amazon has TLD's linked to countries like The Netherlands (.nl), Germany (.de) and Italy (.it). And a lot more, a list of them is on this Wiki page: https://en.wikipedia.org/wiki/Amazon_(company)#Website Maybe add them to the Amazon list as well?
https://developers.facebook.com/docs/apps/test-pages/?_fb_noscript=1
I have your list in my hosts file and it works great, but I can still reach this page! I'm not a coder, so I don't know what to do to figure this out. Please help.
Thank you
For example, the computer file hosts is an operating system file that maps hostnames to IP addresses. It is a plain text file. Originally a file named HOSTS.TXT was manually maintained and made available via file sharing by Stanford Research Institute for the ARPANET membership, containing the hostname.
Would it be interesting to offer other links(search engine) to the hosts file?
Would it be possible to create (and auto-update) non-GitHub mirrors?
As it stands, GasMask on macOS is not able to read URLs that lead to the raw files on GitHub.
Example: SteveBlack also mirrors his hosts files on non-GitHub sites.
all1.txt
is yours (sorted by domain for better readability)
all2.txt
is yours with additions. It includes instagram, CDNs, API and fb.me pages. Is too sorted by domain.
I've included yours, and sorted both for easier compare (using Beyond Compare for example..)
I removed www.
prefix from all of my domains since it can always be added later.
I removed all whatsapp domains (you can add them back) since they are too useful to loose.
Wouldn't it be a better method to create a facebook block list with something like:
whois -h whois.radb.net '!gAS32934' | tr ' ' ', '
as described here? : https://www.perpetual-beta.org/weblog/blocking-facebook-on-os-x.html
for https://github.com/jmdugan/blocklists/tree/master/corporations/pinterest .
I've used https://github.com/eladkarako/SubDomains-with-VirusTotal-and-SecurityTrails-API
which query virustotal for subdomains + domain siblings.
and https://github.com/eladkarako/sortjs (nodejs) with 'sort__extraction_rule__url_upsidedown_domain_order.cmd' to make it easily compare your list and new one.
note 'www.' prefix can be added to anything, I've removed it since my own HOSTS generator https://github.com/eladkarako/hosts/blob/master/_builder.js#L98-L121 (nodejs) does a good job adding it automatically.
I was citing sources earlier today and I have noticed that Nestlé is doing a massive amount of search engine manipulation, even on DuckDuckGo. There were many different domains and it made it really hard to find any criticism about them. I am not sure if Nestlé is using trackers, or if a blocklist is necessary for this project, but they have hundreds of different domains they are using to try and suppress their criticism. Out of 7 different searches and over 60 pages of results, I was only able to find 16 pages not by them. Judging by this, it seems a bit sketchy. Would a Nestlé blocklist suit this project?
I would also be interested in blocking TikTok.
This is still used: https://mail.thefacebook.com
.
I'm am so sorry if I'm not supposed to ask this here, similar queries are not uncommon in git repos & it's definitely relevant to your work, so I'm hoping I'm not being rude or bothersome, but...
Is there any hope of building a blocklist for domains that facilitate stalkerware (like Flexispy, mSpy, AutoForward Spy, etc)? And are they using CDNs like Cloudflare for distribution & delivery or worse...leveraging TLS & other security solutions, like Google's gstatic, gvt1, gvt2...to hide their traffic?
Amazon seems like a strange omission from a blocklist that for some reason even includes Mozilla
FYI, https://github.com/tbds/FreeContributor is now a 404.
Hi
from README.md
Yes. If you want that solution, see dnsmasq and projects like these: FreeContributor.
The FreeContributor link is 404 and a Brave Search doesn't find it https://search.brave.com/search?q=FreeContributor.
Wiped from Internet?
Any alternative?
0.0.0.0 bg-bg.facebook.com
0.0.0.0 bn-in.facebook.com
0.0.0.0 bs-ba.facebook.com
0.0.0.0 business.facebook.com
0.0.0.0 ca-es.facebook.com
0.0.0.0 da-dk.facebook.com
0.0.0.0 developers.facebook.com
0.0.0.0 el-gr.facebook.com
0.0.0.0 error.facebook.com
0.0.0.0 es-es.facebook.com
0.0.0.0 es-la.facebook.com
0.0.0.0 fa-ir.facebook.com
0.0.0.0 fburl.com
0.0.0.0 fi-fi.facebook.com
0.0.0.0 fr-ca.facebook.com
0.0.0.0 fr-fr.facebook.com
0.0.0.0 graph.facebook.com
0.0.0.0 gu-in.facebook.com
0.0.0.0 hi-in.facebook.com
0.0.0.0 hr-hr.facebook.com
0.0.0.0 id-id.facebook.com
0.0.0.0 streaming-graph.facebook.com
0.0.0.0 ta-in.facebook.com
0.0.0.0 te-in.facebook.com
0.0.0.0 th-th.facebook.com
0.0.0.0 upload.facebook.com
0.0.0.0 ur-pk.facebook.com
0.0.0.0 vi-vn.facebook.com
0.0.0.0 secure.facebook.com
0.0.0.0 scontent-bom1-1.xx.fbcdn.net
0.0.0.0 scontent.xx.fbcdn.net
0.0.0.0 our.intern.facebook.com
0.0.0.0 pa-in.facebook.com
0.0.0.0 phabricator.intern.facebook.com
Every day Cisco Umbrella/OpenDNS releases the top 1 million DNS queries they receive. Shuffling through the list finds a big chunk of additional subdomains based on real-world traffic.
http://s3-us-west-1.amazonaws.com/umbrella-static/index.html
duck.com has been transferred to duckduckgo. Please remove the entry from google lists.
giphy is missing in the facebook dataset.
I've removed just the ones that have 'whats' appears in the domain name.
all_facebook__except_for_domains_with_whatsapp_in_their_name.txt
working same way as in #65
I've also removed www. prefix domains since those can be added to every line anyway.
Facebook, Inc. changed its name to Meta. The folder name should be changed, since one might think that "Facebook" means only the social networking platform.
Hi there,
This is an awesome work! please how did you manage to get that? I did it scraping facebook code source using a simple Python script. What about you?
Could you clarify why walmart.com is on the Microsoft blocklist?
There should be an addition of the domains for Threads by Meta Platforms (formerly Facebook, Inc) in these https://github.com/jmdugan/blocklists/tree/master/corporations/facebook lists.
are there plans to include other sites that treat users as chattel? instagram, twit*, et al
rescuemetrics.com
15.197.227.255
3.33.254.229
Hi @jmdugan , I added your great job hosts file with facebook and other domains blocking to https://github.com/FadeMind/hosts.extras via commit FadeMind/hosts.extras@a7ae9f6
Regards, FadeMind
Thanks for your work!
CC @StevenBlack
Please remove
I've had very limited time and attention to devote to basic maintenance, issue handling, etc
some issues are now year+ old and require 10-20 minutes a month on a regular basis to update and reply to things.
Please reply in thread if you've like to be added to the project to help,
Thank you!
Jonathan
I appended the contents of this file to my /etc/hosts file. The behaviour changed as expected after flushing the DNS cache, but upon rebooting the behaviour reverted back to unblocked facebook domains. However, the /etc/hosts file still contains the changes. Does anyone have any suggestions on how to approach this?
blocklists/corporations/facebook/all
Line 6 in 0c08bd0
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.