Skip to content

tolgap/scala-mnist

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark MNIST

The Spark implementation of an ANN running the MNIST dataset.

ANN

The used ANN is bgreeven's ANN implementation in Spark. It has yet to be merged into Spark-MLlib, but it's available as code for now.

Compilation

To compile the project to a .jar file, SBT (Simple Build Tool) is used. The build.sbt file contains project dependencies such as Spark and Hadoop. It also takes care of the Scala compiler downloading.

Usage

Acquire the MNIST dataset in a TSV format from here. Then, run the following commands:

cd spark-mnist
sbt package
cd target/scala-2.10
spark-submit --name "Spark MNIST NN" --master [local|yarn-cluster] --class Mnist mnist_1.0-2.10.jar {train} {test} {output}

For the master option, pass the corresponding master, either local to run it local, or yarn-cluster to run it on a Hadoop cluster.

About

Playground for Spark/Scala Mnist classification

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages