colly 使用多个收集器 ##8 - 走看看

zoukankan html css js c++ java

colly 使用多个收集器 ##8
使用多个收集器

如果任务足够复杂或具有不同类型的子任务，建议为一个抓取工作使用多个收集器。coursera course scraper就是一个很好的例子，它使用了两个收集器——一个解析列表视图并处理分页，另一个收集课程的详细信息。

注意：使用收集器。ID在调试中区分不同的收集器

克隆收集器

如果收集器具有类似的配置，可以使用收集器的Clone()方法。Clone()复制具有相同配置但没有附加回调的收集器。
c := colly.NewCollector( colly.UserAgent("myUserAgent"), colly.AllowedDomains("foo.com", "bar.com"), ) // Custom User-Agent and allowed domains are cloned to c2 c2 := c.Clone()
　　

在收集器之间传递自定义数据

使用collector的Request()函数可以与其他收集器共享上下文。

共享上下文的例子:
c.OnResponse(func(r *colly.Response) { r.Ctx.Put(r.Headers.Get("Custom-Header")) c2.Request("GET", "https://foo.com/", nil, r.Ctx, nil) })
　　
查看全文

相关阅读:
Python 安装Twisted 提示python version 2.7 required,which was not found in the registry
Openfire Strophe开发中文乱码问题
 css div 垂直居中
 How to create custom methods for use in spring security expression language annotations
How to check “hasRole” in Java Code with Spring Security?
Android 显示/隐藏应用图标
 Android 当媒体变更后，通知其他应用重新扫描
 文件上传那些事儿
 专题：点滴Javascript
主流动画实现方式总结

原文地址：https://www.cnblogs.com/liujie-php/p/11571117.html

Copyright © 2011-2022 走看看