Spark MNIST

The Spark implementation of an ANN running the MNIST dataset.

ANN

The used ANN is bgreeven's ANN implementation in Spark. It has yet to be merged into Spark-MLlib, but it's available as code for now.

Compilation

To compile the project to a .jar file, SBT (Simple Build Tool) is used. The build.sbt file contains project dependencies such as Spark and Hadoop. It also takes care of the Scala compiler downloading.

Usage

Acquire the MNIST dataset in a TSV format from here. Then, run the following commands:

cd spark-mnist
sbt package
cd target/scala-2.10
spark-submit --name "Spark MNIST NN" --master [local|yarn-cluster] --class Mnist mnist_1.0-2.10.jar {train} {test} {output}

For the master option, pass the corresponding master, either local to run it local, or yarn-cluster to run it on a Hadoop cluster.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
project		project
src		src
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark MNIST

ANN

Compilation

Usage

About

Releases

Packages

Languages

tolgap/scala-mnist

Folders and files

Latest commit

History

Repository files navigation

Spark MNIST

ANN

Compilation

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages