GitXplorerGitXplorer
c

Financial-Info-Extractor

public
21 stars
6 forks
2 issues

Commits

List of commits on branch master.
Verified
108f230ac5a8cc82064a2ab93b91c36cd44467a0

Update README.md

cchaitanyarahalkar committed 7 years ago
Verified
eb06a51f693e4d62bee080bcfaf56435408796f6

Create LICENSE.md

cchaitanyarahalkar committed 7 years ago
Verified
d8c30a14b5a58b930d48866d931dd5e8f443cb4c

Update README.md

cchaitanyarahalkar committed 7 years ago
Verified
492883d6276ca86f8d1efd8fe8d279c7f18c28bc

Update README.md

cchaitanyarahalkar committed 7 years ago
Verified
a44c149c81e7baf328285bb828d3fc54cb74de94

Update extract.py

cchaitanyarahalkar committed 7 years ago
Verified
02ec2e50c134e6d192fd38b53c0521db68c84f26

Update README.md

cchaitanyarahalkar committed 7 years ago

README

The README file for this repository.

Finance Information Extractor

This is a python script that allows one to extract financial data for any company from Religare(India's leading diversified financial services group). This is written compliant only to the Indian National Stock Exchange since no official API provides this information for the Indian stock market. This script downloads the financial data for the top 500 NSE companies. This is an information scraper written with the Beautiful Soup package for Python.

Following information is extracted for the company

  • Balance Sheet
  • Cashflow
  • Quarterly Earnings
  • Half Earnings
  • Key-Ratio
  • Profit-Loss
(In Consolidated and Standalone type)
(All the data is scraped and stored in .csv format)

Dependencies and Tools

Steps Involved

  • Extraction of the URLs for each company using browser automation. PhantomJS(Headless Browser) is being used for the browser automation part. (Any other browser can also be used)
  • Scraping the data from each URL and storing in CSVs.

Install the dependencies and run the scripts.

Install Beautiful Soup 4,Selenium and Requests using pip.

$ pip install bs4
$ pip install requests
$ pip install selenium

Else download manually here:

Run the scripts:

Add the path to the Browser Driver in the in the url-extractor script.

$ python url-extractor.py
$ python extract.py

License

The MIT License (MIT)

Copyright (c) 2018 Chaitanya Rahalkar