Giter Site home page Giter Site logo

putaodoudou / cn-malicious-website-list Goto Github PK

View Code? Open in Web Editor NEW

This project forked from zzhihao2017/cn-malicious-website-list

0.0 0.0 0.0 740 KB

本列表收录互联网上常见的恶意网站网址。This list contains URLs of malicious websites commonly found on the Internet.

License: GNU General Public License v3.0

cn-malicious-website-list's Introduction

CN-Malicious-website-list **互联网恶意网站列表

本列表收录互联网上常见的恶意网站网址。This list contains URLs of malicious websites commonly found on the Internet.

列表架构

核心列表

本部分列表转引自国内外较为权威的互联网恶意网站列表 由于本部分列表较为权威,不建议广大网友针对本部分列表进行审查 目前转引的主要列表有:

MWSL工作室:OneDNS恶意广告和恶意网址拦截规则(http://www.mwsl.org.cn)
360网站安全:最近恶意网站列表(http://webscan.360.cn/url)
莆田系网站列表(https://github.com/hustcc/PTHospital.chrome)
DNS-sinkhole 恶意网站列表(http://malc0de.com/bl/)
DNS-BH 恶意软件域过滤清单(http://www.malwaredomains.com/wordpress/?page_id=66)

扩展列表

本部分列表为作者@zzhjim自行收集,主要供作者自用,因此有如下特点:

错误较多: 未经过系统的审查,可能出现许多错误屏蔽的网站
频繁更新: 作者日常使用本列表,一有时间,就会更新
有一定的个人倾向性: 作者@zzhjim对一些网站(如百度系、2345系、腾讯电脑管家)存在不满,因此基本予以全站封锁
评判标准不同: 与“核心列表”、“核心扩展列表”有所不同,“扩展列表”对于网页的要求更加严格(详见下文)

“恶意网站”评判标准

本系列列表所定义的“恶意网站”包括但不限于含有较多以下内容网站:

“病毒木马网站”

1、被挂马
2、含有自动(或诱导)下载流氓软件/木马病毒的下载器
3、自动(或诱导)下载冒充为正常软件的流氓软件/病毒木马
4、存在其他被主流网页检测工具报告的病毒/木马行为

“冒牌和钓鱼网站”

1、与知名网站存在域名、功能、样式、外观、图标、名称之一相似,且易导致混淆的
2、存在钓鱼行为、恶意窃取用户隐私的
3、存在恶意跳转行为的
4、搜索引擎显示的名称与实际名称严重不符,易导致使用者迷惑的

“体验较差网站”(一般仅适用于“扩展列表”)

1、网页广告众多、难以采取常规手段拦截,且严重影响视觉体验的
2、网页广告难以采用常规手段拦截,且有大量与有效内容相混淆的内容的
3、原创内容极少,网页价值较低的
4、网站或网站主的其他项目存在恶意窃取用户隐私、严重影响用户权益的行为的
5、对部分国内流氓网站进行的无理由全面封锁
6、网站或网站主的其他网站存在大量标题党新闻、恶意虚假新闻的
7、网站弹出窗口过度,或存在严重影响用户体验的弹出窗口、提示条的 8、由于网站自身原因,加载过于缓慢,且影响用户计算机操作流畅性的

“过度SEO优化网站”(一般仅适用于“扩展列表”,部分适用于“核心列表”)

1、网页恶意跳转到主站,或其他网站的
2、存在堆砌关键词、恶意提高搜索引擎排名、导致搜索引擎搜索到无关内容、恶意转载大量的与网站不相关文章、网页名称中包含无关内容造成使用者迷惑等行为的
3、宣传恶意SEO优化方法的
4、垃圾农场、信息农场等堆砌大量无来源、垃圾内容的
5、恶意堆砌大量内容的导航网站
6、恶意抓取第三方版权内容,进行手动或自动跳转,且效率极差,或网页价值偏低的

“国内流氓软件及网站”(一般仅适用于“扩展列表”,部分适用于“核心列表”)

1、“2345”系列:所有域名全面封禁(如2345导航、多特软件站等)
2、“腾讯电脑管家”:“扩展列表”中执行全面封禁
3、“驱动精灵”、“驱动大师”子频道:选择性封禁
4、百度全家桶系列:“扩展列表”中对“百度卫士”“百度乐彩”“百度游戏”“百度软件”等进行选择性封禁
5、360系列:对部分存在恶意行为的进行选择性封禁

违法、反动网站

1、宣扬**、赌博、毒品的
2、宣扬犯罪行为的
3、宣扬儿童色情的
4、以违法信息为诱饵传播病毒和恶意程序的
5、以违法、反动内容为诱饵传播其他非法信息的
6、存在大量有上述内容的广告、弹出页面、子网站、友情链接的

list architecture

Core list

** this section of the list is quoted from the more authoritative list of Internet malicious websites.

** since this part of the list is more authoritative, it is not recommended that the majority of netizens review the list.

** the main list of current diversion is: **.

MWSL studio: OneDNS malicious advertising and malicious url blocking rules (http://www.mwsl.org.cn)

360 site safety: recent list a malicious web site (http://webscan.360.cn/url)

Putian sites list (https://github.com/hustcc/PTHospital.chrome)

DNS-sinkhole malicious site list (http://malc0de.com/bl/)

DNS - BH domain filtering malicious software list (http://www.malwaredomains.com/wordpress/? Page_id = 66)

Expanded list

The list of this part is collected by @zzhjim and is mainly for self-use by the author, so it has the following characteristics:

** more errors: ** without systematic review, there may be a lot of error blocking websites.

** frequent updates: ** the author USES this list daily, and will update as soon as he has time.

** has a certain personal tendency: ** author @zzhjim is dissatisfied with some websites (such as baidu, 2345, tencent computer manager), so he basically blocks the whole station.

** different criteria: ** differs from "core list" and "core expansion list", and the "extended list" is more stringent for web pages (see below)

"Malicious website" criteria.

The "malicious sites" defined in this series of lists include, but are not limited to, the following web sites:

"Virus Trojan website"

  1. Hung horse

  2. It contains automatic (or induced) downloading of malware/Trojan viruses.

  3. Automatically (or induced) download the malware/virus Trojan that impersonates normal software.

  4. There are other viruses/trojans reported by the mainstream web testing tools.

"Fake and phishing sites"

  1. It is similar to the domain name, function, style, appearance, icon and name of well-known websites, and is easily confused.

  2. There are fishing behaviors and malicious theft of users' privacy.

  3. There is a malicious jump behavior.

  4. The name of the search engine is seriously inconsistent with the actual name, which can lead to user confusion.

"experience poor site" (generally applicable only to "extension list")

  1. There are so many advertisements in web pages that it is difficult to take conventional measures to intercept them and seriously affect the visual experience.

  2. Web ads are difficult to be intercepted by conventional means, and there is a large amount of content that is confused with the effective content.

  3. The original content is very few, and the value of the webpage is low.

  4. Other projects of the website or website owners have malicious intent to steal users' privacy and seriously affect the user's rights and interests.

  5. There is no reason for the comprehensive blockade of some domestic rogue websites.

  6. There are a lot of headline party news and malicious fake news on websites or other websites.

  7. The popup window of the website is excessive, or there are pop-up Windows and prompt bars that seriously affect the user experience.

  8. Due to the website's own reasons, the loading is too slow and affects the user's computer operation fluency.

overseo site (generally applicable to "extension list", part of "core list")

  1. The website maliciously jumps to the main station, or other website.

2, existing stack keywords, malicious to improve search engine rankings, lead to a search engine search to the irrelevant content, malicious transfer a large number of web site and related articles, web name not include irrelevant content cause the user confusion, etc

  1. Promote the optimization method of malicious SEO.

  2. Garbage farm and information farm are piled up with no source or garbage content.

  3. The navigation website with a large amount of malicious content.

  4. Malicious grasp of third-party copyright content, manual or automatic jump, and poor efficiency, or low value of web pages.

"domestic rogue software and website" (generally applicable only to "extended list", part of "core list")

  1. "2345" series: all domain names are completely banned (e.g. 2345 navigation, dort software station, etc.)

  2. "tencent computer housekeeper" : "expansion list" in the implementation of the comprehensive ban.

  3. "drive spirit" and "drive master" subchannel: selective blocking.

  4. Baidu family bucket series: "expansion list" for "baidu guardian" "baidu lottery" "baidu game" "baidu software" and other selective blocking.

  5. 360 series: selective blocking of some malicious ACTS.

Illegal and reactionary website.

  1. Promoting cults, gambling and drugs.

  2. Promoting criminal behavior.

  3. Promoting child pornography.

Disseminate viruses and malicious programs with illegal information.

  1. Disseminate other illegal information with illegal and reactionary content as bait.

  2. There are a lot of advertisements, pop-ups, sub-websites and links of friendship.

cn-malicious-website-list's People

Contributors

zzhihao2017 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.