Find near duplicates on your computer

Algorithms of this site

Near Duplicates Search

Hashing N-dimensional Float Vectors

Understanding image similarity

Facebook Image Similarity Challenge and its winners (2021).

Similarity Search at Flickr

Deep Features as a Perceptual Metric [PDF]

Sentence Transformers to Find (Near)-Duplicates (Github)

Multi-View Image Comparison [PDF]

Image Similarity Algorithm Based on Sparse Coding

Human perception

Gestalt Principles Overview

Gestalt Theory

Gestalt Perceptual Hierarchies

SSIM and metrics

Understanding SSIM (NVIDIA 2020)

From Error Visibility to Structural Similarity [PDF]

Better than SSIM (NVIDIA)

Code for PSNR and SSIM

SSIM in Tensorflow

SSIM in JavaScript

Image Similarity Metrics

PyTorch Image Quality Metrics (Github)

Embeddings

Building a Reverse Image Search Engine (Oreilly e-book)

Embeddings for Detecting Mobile Counterfeit Apps [PDF]

Similarity Metrics for Embeddings

Hyperbolic Image Embeddings [PDF]

Image clustering

Sentence Transformers for Clustering (Github)

Vector search

Vector Similarity Search Overview

Google's Vector Search

k-Nearest Neighbor (k-NN) Search on Amazon AWS

Accelerating Similarity Search with Vector Indexing

Hashing

Image Hash Functions to Find Duplicates

A Survey of Hashing Methods [PDF]

Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions [PDF]

Apple NeuralHash

Comparison of Perceptual Hashes [PDF]

Deep learning

CNNs and Triplet Loss for Product Search Well Explained [PDF]

Semantic-Aware Image Similarity Search [PDF]

Pretrained CNN Features For Similar Image Search

TensorFlow Similarity (Python)

Other

Multiresolution Hash Encoding (NVIDIA)

Microsoft Computer Vision Recipes (Github)

Image-Scaling Attacks (Adversarial Machine Learning)

Suppression of Correlated Noise with Similarity-Based Unsupervised Deep Learning

Statistics

Perlin Noise

Sørensen–Dice Coefficient

Variance and Covariance

Correlation, p-value

R-squared

Median Absolute Deviation