WebCollector-2.52

A java crawler for information collection

统计数据
项目名称 WebCollector
项目主页 https://github.com/CrawlScript/WebCollector
隶属组织
开源协议 GPL2.0
仓库 Central
类库 WebCollector
标签
版本信息
  • 当前版本:2.52
  • 发布时间:2017-06-03 01:46:46
  • 文件大小:96.1 KB
Sha1对照码 22c65e13912ab42b7ca07382127b77e5b03b263c
索引时间
  • 第一版本:
  • 最新版本:2018-07-22 14:50:09
A java crawler for information collection
版本总数 引用类库 依赖类库 相关文件
21 201 11 查看全部文件
资源下载
资源类型 Apache Maven仓库 Repo1(推荐) Repo2 阿里云仓库
Jar包下载: WebCollector-2.52.jar下载 WebCollector-2.52.jar下载 WebCollector-2.52.jar下载 WebCollector-2.52.jar下载
SourceJar包下载: WebCollector-2.52-sources.jar下载 WebCollector-2.52-sources.jar下载 WebCollector-2.52-sources.jar下载 WebCollector-2.52-sources.jar下载
Apache Maven Gradle/Grails Scala SBT Ivy Groovy Grape Leiningen Apache Buildr
研发人员
Id Name Email Website
hujun Hu Jun hujunxianligong@gmail.com
Dependencys
GroupId ArtifactID Version Scope
org.jsoup jsoup 1.9.2
com.googlecode.juniversalchardet juniversalchardet 1.0.3
junit junit 4.11
org.json json 20140107
com.sleepycat je 5.0.73
org.slf4j slf4j-api 1.7.21
org.slf4j slf4j-log4j12 1.7.21
org.seleniumhq.selenium selenium-java 2.44.0 provided
mysql mysql-connector-java 5.1.40
org.springframework spring-jdbc 4.3.5.RELEASE
commons-dbcp commons-dbcp 1.4
许可证
名称 主页
GPL2.0 http://www.gnu.org/licenses/gpl-2.0.html
Include Files
Name
cn.edu.hfut.dmic.contentextractor.ContentExtractor.class
cn.edu.hfut.dmic.contentextractor.News.class
cn.edu.hfut.dmic.webcollector.crawldb.DBManager.class
cn.edu.hfut.dmic.webcollector.crawldb.Generator.class
cn.edu.hfut.dmic.webcollector.crawldb.Injector.class
cn.edu.hfut.dmic.webcollector.crawldb.SegmentWriter.class
cn.edu.hfut.dmic.webcollector.crawler.AutoParseCrawler.class
cn.edu.hfut.dmic.webcollector.crawler.Crawler.class
cn.edu.hfut.dmic.webcollector.example.DemoBingCrawler.class
cn.edu.hfut.dmic.webcollector.example.DemoDepthCrawler.class
cn.edu.hfut.dmic.webcollector.example.DemoHashSetNextFilter.class
cn.edu.hfut.dmic.webcollector.example.DemoMetaCrawler.class
cn.edu.hfut.dmic.webcollector.example.DemoNextFilter.class
cn.edu.hfut.dmic.webcollector.example.DemoPostCrawler.class
cn.edu.hfut.dmic.webcollector.example.DemoSelenium.class
cn.edu.hfut.dmic.webcollector.example.DemoTypeCrawler.class
cn.edu.hfut.dmic.webcollector.example.TutorialCrawler.class
cn.edu.hfut.dmic.webcollector.fetcher.Executor.class
cn.edu.hfut.dmic.webcollector.fetcher.Fetcher.class
cn.edu.hfut.dmic.webcollector.fetcher.NextFilter.class
cn.edu.hfut.dmic.webcollector.fetcher.Visitor.class
cn.edu.hfut.dmic.webcollector.model.CrawlDatum.class
cn.edu.hfut.dmic.webcollector.model.CrawlDatums.class
cn.edu.hfut.dmic.webcollector.model.Links.class
cn.edu.hfut.dmic.webcollector.model.Page.class
cn.edu.hfut.dmic.webcollector.net.HttpRequest.class
cn.edu.hfut.dmic.webcollector.net.HttpResponse.class
cn.edu.hfut.dmic.webcollector.net.Proxys.class
cn.edu.hfut.dmic.webcollector.net.Requester.class
cn.edu.hfut.dmic.webcollector.plugin.berkeley.BerkeleyCrawler.class
cn.edu.hfut.dmic.webcollector.plugin.berkeley.BerkeleyDBManager.class
cn.edu.hfut.dmic.webcollector.plugin.berkeley.BerkeleyDBReader.class
cn.edu.hfut.dmic.webcollector.plugin.berkeley.BerkeleyDBUtils.class
cn.edu.hfut.dmic.webcollector.plugin.berkeley.BerkeleyGenerator.class
cn.edu.hfut.dmic.webcollector.plugin.berkeley.BreadthCrawler.class
cn.edu.hfut.dmic.webcollector.plugin.nextfilter.HashSetNextFilter.class
cn.edu.hfut.dmic.webcollector.plugin.ram.RamCrawler.class
cn.edu.hfut.dmic.webcollector.plugin.ram.RamDB.class
cn.edu.hfut.dmic.webcollector.plugin.ram.RamDBManager.class
cn.edu.hfut.dmic.webcollector.plugin.ram.RamGenerator.class
cn.edu.hfut.dmic.webcollector.util.CharsetDetector.class
cn.edu.hfut.dmic.webcollector.util.Config.class
cn.edu.hfut.dmic.webcollector.util.Counter.class
cn.edu.hfut.dmic.webcollector.util.CrawlDatumFormater.class
cn.edu.hfut.dmic.webcollector.util.FileSystemOutput.class
cn.edu.hfut.dmic.webcollector.util.FileUtils.class
cn.edu.hfut.dmic.webcollector.util.JsoupUtils.class
cn.edu.hfut.dmic.webcollector.util.MysqlHelper.class
cn.edu.hfut.dmic.webcollector.util.RegexRule.class