GitXplorerGitXplorer
x

julia-data-science-base-docker-img

public
13 stars
5 forks
0 issues

Commits

List of commits on branch master.
Verified
c2c85bea2a3c2f024674033dd86a2d783baec88a

Update README.md

xxiaodaigh committed 4 years ago
Verified
5e7df57249e63ebbf0e495f66b4a609b0a72b6f4

Update add-compiled.jl

xxiaodaigh committed 5 years ago
Verified
a35d6904daec9bd347c34a5698746274d093bb66

Update add-compiled.jl

xxiaodaigh committed 5 years ago
Verified
09259792da7d599646de3cd2c6acf5d23eadc7fe

Update compile.jl

xxiaodaigh committed 5 years ago
Verified
061fc4a91441f4fa4a4fdfa6b466e4e9c744e3de

Update add-compiled.jl

xxiaodaigh committed 5 years ago
Verified
6a2cb55edf3897c58a0555ef9a4e747ae24c81d5

Update add-non-compiled.jl

xxiaodaigh committed 5 years ago

README

The README file for this repository.

There is a now a better way to do this with PackageCompiler.jl

Intro

Julia Data Science Docker with data science packages compiled for instant loading!

Time-to-first-plot (TTFP) is often regarded as one of Julia's main pain points. The PackageCompiler.jl package can compile these package and alleviate the pain. It works by pre-"compiling" the packages and baking them into the julia sysimage so that using Pkg1 will be fast just like base packages.

This is an experimental first attempt at making data science packages used by me into a docker image with pre-compiled data science packages.

Usage

Firstly, install Docker. If you are running Windows, I recommend installing git so you have access to git bash.

On Windows you IP can be found using ipconfig and on Linux with ifconfig. This is needed if you wish to do plotting from the docker image.

Basic: Windows

docker run --rm \
	-e DISPLAY=YOUR_IP:0.0 \
	-e JUPYTER_ENABLE_LAB=yes \
	-v "$PWD":/home/jovyan/work\
	-it -p 8888:8888 \
	xiaodaidocker2019/julia-data-science-base

Often one may wish to save the data to somewhere on the hard drive, you may do this by attaching a local folder to the directory somedir.

Packages

The below packages are compiled using PackageCompiler.jl into the image

Package Type Notes
CategoricalArrays Foundation
Clustering Unsupervised learning
CSV Data IO
DataConvenience Data Manipulation/Convenience
DataFrames Data Manipulation
DataFramesMeta Data Manipulation
DecisionTree Supervised learning
FastGroupBy Data Manipulation/Convenience
Feather Data IO
FreqTables Foundation/Statistics
GLM Supervised learning
JDF Data IO For reading/writing JDF files
JLBoost Supervised learning
Lazy Data Manipulation/Convenience
Missings Foundation
Parquet Data IO ParquetFiles is quite broken at the moment
Plots Plotting
RDatasets Data
SortingLab Data Manipulation/Convenience
StatsBase Foundation/Statistics
StatsPlots Plotting
Tables Data Manipulation/Convenience
TableView Data Viewing
XGBoost Supervised learning

The below packages are included but not compiled

Package Type Notes
Pipe Data Manipulation/Convenience If compiled into base then there is warning message with Pipe
TableView Data Viewing If compiled then doesn't work with JupyterLab