添加nginx防止爬虫爬取配置 (#1187)
* 添加nginx防止爬虫爬取配置 * Update nginx.conf * Update README.md
This commit is contained in:
parent
d598dc65ce
commit
89f78bd4c7
16
README.md
16
README.md
|
@ -24,6 +24,7 @@
|
||||||
- [Docker 参数示例](#docker-参数示例)
|
- [Docker 参数示例](#docker-参数示例)
|
||||||
- [Docker build \& Run](#docker-build--run)
|
- [Docker build \& Run](#docker-build--run)
|
||||||
- [Docker compose](#docker-compose)
|
- [Docker compose](#docker-compose)
|
||||||
|
- [防止爬虫抓取](#防止爬虫抓取)
|
||||||
- [使用 Railway 部署](#使用-railway-部署)
|
- [使用 Railway 部署](#使用-railway-部署)
|
||||||
- [Railway 环境变量](#railway-环境变量)
|
- [Railway 环境变量](#railway-环境变量)
|
||||||
- [手动打包](#手动打包)
|
- [手动打包](#手动打包)
|
||||||
|
@ -234,6 +235,21 @@ services:
|
||||||
```
|
```
|
||||||
- `OPENAI_API_BASE_URL` 可选,设置 `OPENAI_API_KEY` 时可用
|
- `OPENAI_API_BASE_URL` 可选,设置 `OPENAI_API_KEY` 时可用
|
||||||
- `OPENAI_API_MODEL` 可选,设置 `OPENAI_API_KEY` 时可用
|
- `OPENAI_API_MODEL` 可选,设置 `OPENAI_API_KEY` 时可用
|
||||||
|
|
||||||
|
#### 防止爬虫抓取
|
||||||
|
|
||||||
|
**nginx**
|
||||||
|
|
||||||
|
将下面配置填入nginx配置文件中,可以参考 `docker-compose/nginx/nginx.conf` 文件中添加反爬虫的方法
|
||||||
|
|
||||||
|
```
|
||||||
|
# 防止爬虫抓取
|
||||||
|
if ($http_user_agent ~* "360Spider|JikeSpider|Spider|spider|bot|Bot|2345Explorer|curl|wget|webZIP|qihoobot|Baiduspider|Googlebot|Googlebot-Mobile|Googlebot-Image|Mediapartners-Google|Adsbot-Google|Feedfetcher-Google|Yahoo! Slurp|Yahoo! Slurp China|YoudaoBot|Sosospider|Sogou spider|Sogou web spider|MSNBot|ia_archiver|Tomato Bot|NSPlayer|bingbot")
|
||||||
|
{
|
||||||
|
return 403;
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
### 使用 Railway 部署
|
### 使用 Railway 部署
|
||||||
|
|
||||||
[![Deploy on Railway](https://railway.app/button.svg)](https://railway.app/new/template/yytmgc)
|
[![Deploy on Railway](https://railway.app/button.svg)](https://railway.app/new/template/yytmgc)
|
||||||
|
|
|
@ -3,6 +3,13 @@ server {
|
||||||
server_name localhost;
|
server_name localhost;
|
||||||
charset utf-8;
|
charset utf-8;
|
||||||
error_page 500 502 503 504 /50x.html;
|
error_page 500 502 503 504 /50x.html;
|
||||||
|
|
||||||
|
# 防止爬虫抓取
|
||||||
|
if ($http_user_agent ~* "360Spider|JikeSpider|Spider|spider|bot|Bot|2345Explorer|curl|wget|webZIP|qihoobot|Baiduspider|Googlebot|Googlebot-Mobile|Googlebot-Image|Mediapartners-Google|Adsbot-Google|Feedfetcher-Google|Yahoo! Slurp|Yahoo! Slurp China|YoudaoBot|Sosospider|Sogou spider|Sogou web spider|MSNBot|ia_archiver|Tomato Bot|NSPlayer|bingbot")
|
||||||
|
{
|
||||||
|
return 403;
|
||||||
|
}
|
||||||
|
|
||||||
location / {
|
location / {
|
||||||
root /usr/share/nginx/html;
|
root /usr/share/nginx/html;
|
||||||
try_files $uri /index.html;
|
try_files $uri /index.html;
|
||||||
|
|
Loading…
Reference in New Issue