Highlights
- Pro
Block or Report
Block or report mihirpatel7
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage: Scala
Sort by: Most stars
Starred repositories
State of the Art Natural Language Processing
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Free Elasticsearch security plugin and Kibana security plugin: super-easy Kibana multi-tenancy, Encryption, Authentication, Authorization, Auditing
Experimental project to lay out basic algebra type classes
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
Coursera Machine Learning class examples in Spark
provide preprocessing platform for Lucene indexing and comprehensive Learning-to-Rank modules
An implementation of Bisecting KMeans Clustering which is a kind of Hierarchical Clustering algorithm on Spark