Group: Debatty
Sort by:Popular

Debatty Mark
Implementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity...
Last Release on May 12, 2020

3.Java LSH11 usages

info.debatty » java-lsh MIT

A Java implementation of Locality Sensitive Hashing (LSH)
Last Release on Jun 24, 2019

4.Java SpamSum4 usages

info.debatty » java-spamsum MIT

A Java implementation of SpamSum / SSDeep / Context Triggered Piecewise Hashing
Last Release on Aug 11, 2016
Implementation of aggregation operators like WA, OWA and WOWA
Last Release on Oct 29, 2020

6.Java Graphs4 usages

info.debatty » java-graphs MIT

Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...
Last Release on Nov 23, 2017

7.Java Datasets3 usages

info.debatty » java-datasets MIT

Java library for parsing various datasets: ENRON email dataset, Wikipedia web pages, DBLP papers, Reuters news ...
Last Release on Sep 7, 2017

8.Spark Kmedoids1 usages

info.debatty » spark-kmedoids MIT

Spark implementation of various k-medoids clustering algorithms.
Last Release on Apr 17, 2018

9.Jinu

info.debatty » jinu MIT

Easy evaluation and comparison of Java algorithms.
Last Release on Dec 4, 2017

10.Spark KNN Graphs

info.debatty » spark-knn-graphs MIT

Spark algorithms for building k-nn graphs
Last Release on Sep 16, 2017

11.Spark Nndescent

info.debatty » spark-nndescent MIT

Spark implementation of NN-Descent algorithm for building k-nn graphs, based on the paper "Efficient K-Nearest Neighbor Graph Construction for Generic Similarity Measures" by Dong et al.
Last Release on May 18, 2015
Maven plugin to publish on sparkpackages.org
Last Release on Jun 3, 2017
Maven plugin to publish on sparkpackages.org
Last Release on Dec 3, 2015