W3C_annual_most_used_survey.../README.md

35 lines
3.1 KiB
Markdown
Raw Normal View History

2020-02-20 10:40:40 +01:00
SOURCES
This is a minimial blocklist based on the W3Tech annual surveys on most used webtechnology. In stead of crawling the web analyzing
Alexa's TOP 500.000 websites, I looked at the web industry itself, to block the bulk of the mainstream advertising services with the
obvious benefit of low maintenance effort (for me) and low performance impact (for the adblocker you are using).
2020-01-23 12:25:40 +01:00
2020-02-20 10:43:24 +01:00
WHY USE A MINIMAL BLOCKLIST?
2020-02-20 09:10:34 +01:00
In advertising the number one (Google) has a marketshare of around 40 percent, Facebook the number two hits the 20 percent mark
2020-02-20 09:34:45 +01:00
while the number three Comscore just has a little over 2.5% marketshare. For comparison Amazon with its huge webstore generates
2020-02-20 09:34:18 +01:00
about the same advertising traffic on its own website.
2020-01-23 12:25:40 +01:00
2020-02-20 10:41:43 +01:00
Number 200 on this list only has a marginal market share of 0.1 % 1n the Alexa top 10000. Imagine how effective URL number 50.000 will
2020-02-20 10:27:04 +01:00
be of your blocklist. Fair chance it will never be triggered. Some adblockers have an option to exclude URL's with low usage (like
AdGuard 'use optimized filters' option) or periodically filter out dead URL's (Opera's and Brave's build-in adblockers).
2020-02-20 09:50:18 +01:00
2020-02-20 10:27:04 +01:00
Opera performed an analysis on the effectiveness of Easylist and reported in a blog that 20 percent of the listed URL were dead and
another 60% had a 'hit rate' of less than 1% of all the traffic of the Opera browser users (having agreed to provide telemetry data).
2020-02-20 13:00:30 +01:00
Scientific studies show that large blocklist are a little more effective on 'long tail' websites (not in Alexa top 500.000 with more
than 2000 unique visitors per day), but a little less effective on Alexa top top 300000 (websites with more than 3000+ visitors per
day). Websites with many visitors are more likely to install adblock walls to protect advertising income. Anti-adblock walls often
check for popular content filters (like Easylist), see for instance https://browserleaks.com/proxy
2020-02-20 09:50:18 +01:00
2020-02-20 10:10:14 +01:00
ALTERNATIVES
2020-02-20 12:47:35 +01:00
When you want a common use well maintained small blocklist, use Disconnect's Simple Ad-filter or Peter Low's blocklist (both have over
2020-02-20 12:48:08 +01:00
3000 blacklisted URL's). Disconnect filter is also used by Firefox and Edge anti-tracking. When you want a well maintained medium sized
2020-02-20 12:47:35 +01:00
blocklist, use Steven Black's blocklist (more than 50.000 URL's blacklisted). When you want a large blocklist have a look at the
'ultimate' blocklist of Energized.pro (over 700K URL's blacklisted). Benefit of Energized is that they rempve dead URL's.
2020-02-20 09:50:18 +01:00
2020-02-20 10:10:14 +01:00
ERRORS & ISSUES
2020-02-20 09:50:18 +01:00
When you still want to use this blocklist, feel free to use it and provide feedback on errors. You can post issues, but I will only have
2020-02-20 11:05:14 +01:00
a look at it when the issue causes a problem on an COM, INF, NET or ORG domain. You can fix a problem easily yourself in most adblock
extensions (e.g. AdBlockPlus, Adguard, uBlockOrigin). Look for UserFilter or MyFilter in the extension options. For instance when this
filter causes a problem on website ABC_example.com, simply add a badfilter using this syntax:
2020-02-20 11:05:46 +01:00
2020-02-20 11:02:26 +01:00
||ABC_example.com$badfilter