GitXplorerGitXplorer
L

Seattle-House-Sales-Weather-Predictions

public
0 stars
0 forks
0 issues

Commits

List of commits on branch master.
Verified
43c09cf043b42caec0c08a00c24950169f42d0ab

Update README.md

LLikithaVemulapalli committed 4 years ago
Verified
037ac4003e1aa255f9b14340c226f30ac7f263be

Update README.md

LLikithaVemulapalli committed 4 years ago
Unverified
a969a9e7b156ae0c3aa7cd4065ac3cd9e73bd322

First Commit

LLikithaVemulapalli committed 4 years ago
Unverified
63a37c72b94c92db3bc4139d3d746d9100e335f0

First commit

LLikithaVemulapalli committed 4 years ago
Unverified
dfa87620f37a644cb3f2463c8ad67d86300875d3

First Commit

LLikithaVemulapalli committed 4 years ago

README

The README file for this repository.

Data Exploration

For this project, I have considered House sales data of King County, Seattle and Seattle weather datasets from Kaggle.

House Sales: https://www.kaggle.com/harlfoxem/housesalesprediction

Seattle Weather : https://www.kaggle.com/rtatman/did-it-rain-in-seattle-19482017

Steps to execute:

  1. Download the files from the github repository.
  2. Unzip the Data_Science_Project1.zip file to get the kc_house_data.csv and Seattle weather data file.
  3. Place the csv files in data folder and place the data folder in notebooks folder. The notebooks folder should also have ipynb file as well.
  4. Navigate to terminal and type "jupyter notebook"
  5. Navigate to the folder where the notebook is placed.
  6. From the menu icon cell, click on Run all which will run the whole notebook from the first cell. Verify the results.

This project depicts how weather impacts the house sales in Seattle.

Travelling the world on a mission to discover new data

  1. Set up a data science project structure in a new git repository in your GitHub account
  2. Install Jupyter notebook prerequisites (Anaconda, Python, etc.)
  3. Select an industry
  4. Select two to three public data sets from that industry
  5. Load the data sets into panda data frames following the 10 minutes to pandas guide
  6. Formulate one or two ideas on how the data sets could be combined to establish additional value using exploratory data analysis
  7. Transform the data sets into a single data set while following data preparation processes to clean and transform features (use pandas documentation for help)
  8. Document your process and results
  9. Commit your notebook, source code, visualizations and other supporting files to the git repository in GitHub