W3C_annual_most_used_survey.../README.md

32 lines
3.0 KiB
Markdown
Raw Normal View History

2020-02-20 10:40:40 +01:00
SOURCES
This is a minimial blocklist based on the W3Tech annual surveys on most used webtechnology. In stead of crawling the web analyzing
Alexa's TOP 500.000 websites, I looked at the web industry itself, to block the bulk of the mainstream advertising services with the
obvious benefit of low maintenance effort (for me) and low performance impact (for the adblocker you are using).
2020-01-23 12:25:40 +01:00
2020-02-20 10:43:24 +01:00
WHY USE A MINIMAL BLOCKLIST?
2020-02-20 09:10:34 +01:00
In advertising the number one (Google) has a marketshare of around 40 percent, Facebook the number two hits the 20 percent mark
2020-02-20 09:34:45 +01:00
while the number three Comscore just has a little over 2.5% marketshare. For comparison Amazon with its huge webstore generates
2020-02-20 09:34:18 +01:00
about the same advertising traffic on its own website.
2020-01-23 12:25:40 +01:00
2020-02-20 10:41:43 +01:00
Number 200 on this list only has a marginal market share of 0.1 % 1n the Alexa top 10000. Imagine how effective URL number 50.000 will
2020-02-20 10:27:04 +01:00
be of your blocklist. Fair chance it will never be triggered. Some adblockers have an option to exclude URL's with low usage (like
AdGuard 'use optimized filters' option) or periodically filter out dead URL's (Opera's and Brave's build-in adblockers).
2020-02-20 09:50:18 +01:00
2020-02-20 10:27:04 +01:00
Opera performed an analysis on the effectiveness of Easylist and reported in a blog that 20 percent of the listed URL were dead and
another 60% had a 'hit rate' of less than 1% of all the traffic of the Opera browser users (having agreed to provide telemetry data).
2020-02-20 10:45:58 +01:00
Scientific studies show that large blocklist are (a little) more effective on 'long tail' websites (not in Alexa top 500.000), but
less effective on Alexa top top 500000 (websites with more than 2000+ visitors per day), because common blocklist (like Easylist)
often trigger adblock walls and anti-adblock measures or even help to identify you (see for instance https://browserleaks.com/proxy).
2020-02-20 09:50:18 +01:00
2020-02-20 10:10:14 +01:00
ALTERNATIVES
2020-02-20 10:32:24 +01:00
When you want a common use well maintained small blocklist, use Disconnect's common Ad-filter or Peter Low's blocklist (over 3000
2020-02-20 10:34:33 +01:00
blacklisted URL's). When you want a well maintained medium sized blocklist, use Steven Black's blocklist (more than 50.000 URL's
blacklisted). When you want a large blocklist have a look at the 'ultimate' blocklist of Energized.pro (over 700K URL's blacklisted).
2020-02-20 09:50:18 +01:00
2020-02-20 10:10:14 +01:00
ERRORS & ISSUES
2020-02-20 09:50:18 +01:00
When you still want to use this blocklist, feel free to use it and provide feedback on errors. You can post issues, but I will only have
2020-02-20 11:02:26 +01:00
a look at it when the issue causes a problem on an COM, INF, NET or ORG domain. Most adblocker-extensions with an option to add your own
custom filyters (e.g. AdBlockPlus, Adguard, uBlockOrigin) have an option to correct problems in filters with 'badfilter' option (look
for UserFilter or MyFilter). When this filter causes a problem on website ABC_example.com, simply add a badfilter using this syntax:
||ABC_example.com$badfilter