[go: nahoru, domu]

Skip to content

A new arguably faster implementation of Apache Spark from scratch in Rust

License

Notifications You must be signed in to change notification settings

hugcoday/native_spark

 
 

Repository files navigation

native_spark

Join the chat at https://gitter.im/fast_spark/community Build Status License

Documentation

A new, arguably faster, implementation of Apache Spark from scratch in Rust. WIP

Framework tested only on Linux, requires nightly Rust. Read how to get started in the documentation.

ToDo

  • Error Handling(Priority)
  • Fault tolerance

RDD

Most of these except file reader and writer are trivial to implement

  • map
  • flat_map
  • filter
  • group_by
  • reduce_by
  • distinct
  • count
  • take_sample
  • union
  • glom
  • cartesian
  • pipe
  • map_partitions
  • for_each
  • collect
  • reduce
  • fold
  • aggregate
  • take
  • first
  • sample
  • zip
  • save_as_text_file (can save only as text file in executors local file system)

Config Files

  • Replace hard coded values

About

A new arguably faster implementation of Apache Spark from scratch in Rust

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Rust 98.7%
  • Other 1.3%