jar

io.github.jasperroel : SiteCrawler

Maven & Gradle

Jul 30, 2018
22 stars

SiteCrawler · This project provides a simple WebCrawler with retry-capabilities, functionality to distinguish between http/https sites. It biggest feature is that it allows for plugins (or CrawlerActions), which allows you to hook your scripts into the crawling process. It also allow for setting "blocked" URLs. Those URLs or patterns will not be crawled.

Table Of Contents

Latest Version

Download io.github.jasperroel : SiteCrawler JAR file - Latest Versions:

All Versions

Download io.github.jasperroel : SiteCrawler JAR file - All Versions:

Version Vulnerabilities Size Updated
1.0.x

View Java Class Source Code in JAR file

  1. Download JD-GUI to open JAR file and explore Java source code file (.class .java)
  2. Click menu "File → Open File..." or just drag-and-drop the JAR file in the JD-GUI window SiteCrawler-1.0.0.jar file.
    Once you open a JAR file, all the java classes in the JAR file will be displayed.

com.salesforce.webdev.sitecrawler.navigation

├─ com.salesforce.webdev.sitecrawler.navigation.NavigateThread.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.navigation.NavigateThreadException.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.navigation.ProcessPage.class - [JAR]

com.salesforce.webdev.sitecrawler.webclient

├─ com.salesforce.webdev.sitecrawler.webclient.WebClientExtended.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.webclient.WebClientFactory.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.webclient.WebClientPool.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.webclient.WebClientPoolClosedException.class - [JAR]

com.salesforce.webdev.sitecrawler

├─ com.salesforce.webdev.sitecrawler.SiteCrawler.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.SiteCrawlerAction.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.SiteCrawlerErrorCodes.class - [JAR]

com.salesforce.webdev.sitecrawler.utils

├─ com.salesforce.webdev.sitecrawler.utils.EmptyJavascriptEngine.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.utils.NamedThreadFactory.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.utils.URLCleaner.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.utils.URLNormalizer.class - [JAR]

com.salesforce.webdev.sitecrawler.beans

├─ com.salesforce.webdev.sitecrawler.beans.CrawlProgress.class - [JAR]

├─ com.salesforce.webdev.sitecrawler.beans.CrawlerConfiguration.class - [JAR]

Advertisement

Dependencies from Group

Jul 30, 2018
22 stars

Discover Dependencies

Sep 26, 2018
Feb 27, 2023
1 usages
110 stars
Jan 27, 2020
1 usages
Aug 15, 2023
2 usages
0 stars
Mar 09, 2023
2 usages
Aug 02, 2018
Jan 19, 2023
3 usages
12k stars
mln
Jul 14, 2020
1 usages
Aug 06, 2018
1 usages
3 stars