Highlights
- Pro
Block or Report
Block or report mihirpatel7
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage: Scala
Sort by: Recently active
Starred repositories
State of the Art Natural Language Processing
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
Free Elasticsearch security plugin and Kibana security plugin: super-easy Kibana multi-tenancy, Encryption, Authentication, Authorization, Auditing
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Experimental project to lay out basic algebra type classes
provide preprocessing platform for Lucene indexing and comprehensive Learning-to-Rank modules
An implementation of Bisecting KMeans Clustering which is a kind of Hierarchical Clustering algorithm on Spark
Coursera Machine Learning class examples in Spark