GitXplorerGitXplorer
a

Space-Mice-Gene-Expression-Analysis-Feature-Identification-and-Anomaly-Detection

public
0 stars
0 forks
0 issues

Commits

List of commits on branch main.
Verified
2d3e8c2486f5d4b44f5ff83d795f2b571da84d68

Update README.md

aarpi5191 committed a month ago
Verified
2280331df9c1fc324b8c754bcc753b39e329270f

Update README.md

aarpi5191 committed a month ago
Verified
a03a822c29d1f5bbb12dff636327473437cfa4f7

Add files via upload

aarpi5191 committed a month ago
Verified
fd894bfbe008b696e2a937f6873bf6548c35b42f

Update README.md

aarpi5191 committed a year ago
Verified
25a568acbdf59db8d1fcb738df37b8bbd679d2ca

Rename PCA (13).ipynb to PCA.ipynb

aarpi5191 committed a year ago
Verified
17c02ac0c3b58adc185d95abbb5cfc5a99e25ba8

Add files via upload

aarpi5191 committed a year ago

README

The README file for this repository.

Space Mice Gene Expression Analysis: Feature Identification and Anomaly Detection

Implements Principal Component Analysis (PCA) on a NASA dataset to identify the features contributing most significantly to the mice's gene expression. Utilizes a machine learning pipeline with the Python package, Ibis, to perform the analysis. Additionally, applies the Isolation Forest algorithm to detect anomalies in the dataset, uncovering potential outliers.

Instructions:

  1. Download the metadata.csv and data.csv files from the NASA website at https://osdr.nasa.gov/bio/repo/data/studies/OSD-665 to a platform like Jupyter Notebook or Google Colab.
  2. Download the PCA.ipynb notebook and upload it to the same platform. Open the PCA.ipynb notebook and run it to perform Principal Component Analysis (PCA) on the dataset. The analysis will identify the most significant factors—condition, library preparation (libprep), and strain—that contribute to gene expression variation between mice in space and on the ground.
  3. Download the Ibis_PCA.py script and upload it to the same platform. Run "python Ibis_PCA.py" to perform PCA analysis through a machine learning pipeline utilizing the Ibis package.
  4. Download the IsolationForestAlgorithm.ipynb notebook and upload it to the same platform. Open the IsolationForestAlgorithm.ipynb notebook and run it to apply the Isolation Forest Algorithm to the dataset. This analysis will identify anomalies in the dataset.