GitXplorerGitXplorer
m

spark-jython-udf

public
6 stars
1 forks
0 issues

Commits

List of commits on branch master.
Unverified
5441013b170f51e47253efbca9ca54a08ba759c0

Improved readme

mmariusvniekerk committed 8 years ago
Unverified
f5818a375de5709876f9894b179b452ca20e9162

Update README.md

mmariusvniekerk committed 8 years ago
Unverified
5a680d842ad8bfacf9fd959558681bc05e24eb8e

Update .travis.yml

mmariusvniekerk committed 8 years ago
Unverified
9525a38ff61b2755260d6b638e5bd0364892cd50

Update .travis.yml

mmariusvniekerk committed 8 years ago
Unverified
b2df48b0df8a2297fe87b78aa67fdb194c6a96ae

Made some more pieces transient.

mmariusvniekerk committed 8 years ago
Unverified
31017818c75ffa45bb1db123bc0fe54dea40dbfc

Added missing import

mmariusvniekerk committed 8 years ago

README

The README file for this repository.

Build Status

spark-jython-udf

This is an initial attempt at a spark package that captures the core concepts from SPARK-15369 and attempts to turn that into an installable spark-package.

Thanks

This would not have been possible without the considerable efforts of @holdenk pushing python forward in the Apache Spark community

Usage

In a python instance that already has a spark context instantiated and the spark-jython library loaded.

import spark_jython
from pyspark.sql.types import *

def jythonfn(arg1):
    return arg1.split(" ")

returnUDFType = ArrayType(StringType())
jythonUDF = spark_session.catalog.registerJythonFunction("name", function, returnUDFType)