GitXplorerGitXplorer
n

WebScraper

public
2 stars
1 forks
0 issues

Commits

List of commits on branch master.
Unverified
6dd2f5741963c5cbcf3b9ee8f8bb2ac39372732b

Run by docker information

nnikitap492 committed 8 years ago
Unverified
e55f2f49cc07c2c3697c49c27caefbfb31027407

Integration with docker

nnikitap492 committed 8 years ago
Unverified
c1c12f44d1b7497478b0360ac03e5a874a03168b

Added Dockerfile

nnikitap492 committed 8 years ago
Unverified
1f1fd25dfd19bfbd23f16f6369595b23104117e7

Entrypoint script

nnikitap492 committed 8 years ago
Unverified
85489ed4e98ad6f73af0cd2b9f647135b2d963e1

Add fatJar task

nnikitap492 committed 8 years ago
Unverified
f966b041ae63fe20654ab08eccd56a4e30d5a1a7

Fix nulls

nnikitap492 committed 8 years ago

README

The README file for this repository.

Build Status codecov.io CodeFactor

WebScraper

Simple application for data scraping.

How to try this project?

You must have gradle and jdk 8. In project directory create jar by command gradle build. When jar will be created in build/libs. Run it java -jar scraper.jar uri words options .

Or just docker. Run docker run --rm nikitap/webscraper uri words options

URI this is web site url or path to file with URLs, words are searchable words separated by delimeter "," and options enables different type of analyze:

  • -c for counting chars on web pages
  • -w for counting words on web pages
  • -e for scrapping sentences for searchable words