目前除了我們常見的搜索引擎如百度、Google、Sogou、360等搜索引擎之外,還存在其他非常多的搜索引擎,通常這些搜索引擎不僅不會(huì)帶來流量,因?yàn)榇罅康淖ト≌埱螅會(huì)造成主機(jī)的CPU和帶寬資源浪費(fèi),屏蔽方法也很簡單,按照下面步驟操作即可,原理就是分析指定UA然后屏蔽。
首先進(jìn)入寶塔面板,文件管理進(jìn)入 /www/server/nginx/conf 目錄,新建空白文件 kill_bot.conf。然后將以下代碼保存到當(dāng)前文件中。
#禁止垃圾搜索引擎蜘蛛抓取 if ($http_user_agent ~* "CheckMarkNetwork|Synapse|Nimbostratus-Bot|Dark|scraper|LMAO|Hakai|Gemini|Wappalyzer|masscan|crawler4j|Mappy|Center|eright|aiohttp|MauiBot|Crawler|researchscan|Dispatch|AlphaBot|Census|ips-agent|NetcraftSurveyAgent|ToutiaoSpider|EasyHttp|Iframely|sysscan|fasthttp|muhstik|DeuSu|mstshash|HTTP_Request|ExtLinksBot|package|SafeDNSBot|CPython|SiteExplorer|SSH|MegaIndex|BUbiNG|CCBot|NetTrack|Digincore|aiHitBot|SurdotlyBot|null|SemrushBot|Test|Copied|ltx71|Nmap|DotBot|AdsBot|InetURL|Pcore-HTTP|PocketParser|Wotbox|newspaper|DnyzBot|redback|PiplBot|SMTBot|WinHTTP|Auto Spider 1.0|GrabNet|TurnitinBot|Go-Ahead-Got-It|Download Demon|Go!Zilla|GetWeb!|GetRight|libwww-perl|Cliqzbot|MailChimp|SMTBot|Dataprovider|XoviBot|linkdexbot|SeznamBot|Qwantify|spbot|evc-batch|zgrab|Go-http-client|FeedDemon|JikeSpider|Indy Library|Alexa Toolbar|AskTbFXTV|AhrefsBot|CrawlDaddy|CoolpadWebkit|Java|UniversalFeedParser|ApacheBench|Microsoft URL Control|Swiftbot|ZmEu|jaunty|Python-urllib|lightDeckReports Bot|YYSpider|DigExt|YisouSpider|HttpClient|MJ12bot|EasouSpider|LinkpadBot|Ezooms") { return 403; break; } #禁止掃描工具客戶端 if ($http_user_agent ~* "crawl|curb|git|Wtrace|Scrapy" ) { return 403; break; } |
保存后返回到寶塔 - 【網(wǎng)站】-【設(shè)置】點(diǎn)擊左側(cè) 【配置文件】選項(xiàng)卡,在 #SSL-START SSL相關(guān)配置,請勿刪除或修改下一行帶注釋的404規(guī)則 上方空白行插入代碼: include kill_bot.conf; 保存后即可生效,這樣這些蜘蛛或工具掃描網(wǎng)站的時(shí)候就會(huì)提示403禁止訪問。