GitXplorerGitXplorer
n

Webtrench

public
25 stars
5 forks
3 issues

Commits

List of commits on branch main.
Verified
10728e849dd6f481e77f76cb0ae332e91b3c1e81

Update README.md

nnuhmanpk committed 2 years ago
Verified
e211e0f9736b9633e0badf6d442c5bdaa38cf66d

Update README.md

nnuhmanpk committed 2 years ago
Verified
84cfbe7fc2234b891dda01db00963246d92f85f7

Update README.md

nnuhmanpk committed 2 years ago
Unverified
3e75dc09659ad527156e766b10bfed7e2e388db5

Changes in function.Short name for less complexity

nnuhmanpk committed 2 years ago
Verified
66be181c679d7795dddb7c96baed37852094a0f8

Update requirements.txt

nnuhmanpk committed 2 years ago
Unverified
2898440cb47da06820420a98929061c58b483e8d

Removed old requirements , Added new

nnuhmanpk committed 2 years ago

README

The README file for this repository.

Webtrench

WebTrench provides a comprehensive and powerful toolkit for web scraping. Whether you're working on a machine learning project, conducting research, or simply need to gather data from the web, WebTrench is the perfect tool for the job. So why wait? Start using WebTrench today and streamline your data collection process!

pip install Webtrench

Check Documentation Here


Downloads PyPI - Format GitHub license Upload Python Package Supported Versions PyPI Documentation Status PyPI - Downloads Downloads

Why WebTrench

Easy to use: With its simple and intuitive interface, WebTrench makes it easy to extract data from the web. Comprehensive: WebTrench includes functions for extracting a wide range of data, from images to tables and beyond. Fast and efficient: WebTrench is designed to be fast and efficient, so you can quickly gather the data you need. Suitable for a variety of use cases: Whether you're working on a machine learning project, conducting research, or simply need to gather data from the web, WebTrench is a versatile tool that can meet your needs.

from Webtrench import ImageScrapper
url = 'https://example.com'
folder_path = './images'
ImageScrapper.all_image_from_url(url, folder_path)

This code snippet downloads an image from the URL https://example.com/image.jpg and saves it in the ./images folder with a random number as the file name.

Limitations of WebTrench

Depends on website structure: The success of web scraping with WebTrench depends on the structure of the website being scraped. If the website's structure changes, WebTrench may not work as expected. Legal restrictions: There may be legal restrictions on the use of web scraping, so it's important to familiarize yourself with the laws in your jurisdiction before using WebTrench.

Privacy Policy

WebTrench respects the privacy of its users and is committed to protecting their data. We do not collect or store any personal information, and all data collected through the use of WebTrench is kept confidential.

Web Scraping Ethics

When using WebTrench or any other web scraping tool, it's important to follow ethical guidelines and avoid scraping websites without the owner's permission. This includes websites that explicitly prohibit scraping, as well as websites that contain sensitive or confidential information.

Legal Warning

The use of web scraping may be subject to legal restrictions, and the legality of web scraping depends on the jurisdiction in which it is being used. Before using WebTrench, it's important to familiarize yourself with the laws in your jurisdiction and ensure that your use of the tool complies with all applicable laws. WebTrench cannot be held responsible for any illegal use of the tool.

Contributing Guide

We welcome contributions from the community! If you are interested in contributing to the WebTrench project, here are some guidelines to get started:

  • Check the issues page to see if there are any open bugs or features that you would like to work on.
  • Fork the repository and make your changes in a separate branch.
  • Once you have made your changes, submit a pull request for review.
  • The project maintainers will review your pull request and provide feedback. If necessary, make any requested changes and resubmit your pull request.
  • Once your pull request is approved and merged, you will become a contributor to the WebTrench project!

Project Clone Guide

If you would like to clone the WebTrench repository, follow these steps:

  • Install Git on your computer.
  • Open a terminal window and navigate to the directory where you would like to clone the repository.
  • Run the following command: git clone https://github.com/nuhmanpk/WebTrench.git
  • The repository will be cloned to your computer, and you can now make changes to the code and contribute to the project.

Reminder

Please note that WebTrench is currently in the pre-release stage and is not yet finished. If you encounter any issues, please check the issues page, or consider contributing to make a better version of WebTrench!