區域性敏感雜湊LSH原作者的論文和程式LSH Algorithm and Implementation (E2LSH)
LSH Algorithm and Implementation (E2LSH)
Locality-Sensitive Hashing (LSH) is an algorithm for solving the approximate or exact Near Neighbor Search in high dimensional spaces. This webpage links to the newest LSH algorithms in Euclidean and Hamming spaces, as well as the E2LSH package, an implementation of an early practical LSH algorithm.
Locality-Sensitive Hashing (LSH) is an algorithm for solving the approximate or exact Near Neighbor Search in high dimensional spaces. This webpage links to the newest LSH algorithms in Euclidean and Hamming spaces, as well as the E2LSH package, an implementation of an early practical LSH algorithm.
- Algorithm description:
- Newest (not quite) LSH algorithms (2014): These algorithms achieve performance better than the classic LSH algorithms by usingdata-dependent hashing. They improve over classic LSH algorithms for both Hamming
and Euclidean space. These algorithms are not dynamic however, in contrast to the classic LSH algorithms, which use data-independent hashing and hence allow updates to the pointset.
Optimal Data-Dependent Hashing for Approximate Near Neighbors (by Alexandr Andoni and Ilya Razenshteyn). In STOC'15 (to appear). Full version in arXiv:1501.01062.
Beyond Locality Sensitive Hashing (by Alexandr Andoni, Piotr Indyk, Huy L. Nguyen, and Ilya Razenshteyn). In SODA'14.
Slides: Here are some slides by Alexandr Andoni on the early version from SODA'14. - Survey of LSH in CACM (2008): <a
"Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions" (by Alexandr Andoni and Piotr Indyk). Communications of the ACM, vol. 51, no. 1, 2008,
pp. 117-122. (CACM disclaimer).
also available directly from CACM (for free). MostNot so recent algorithm for Euclidean space (2006): "Near-Optimal Hashing Algorithms for Near Neighbor Problem in High Dimensions" (by Alexandr Andoni and Piotr Indyk). In FOCS'06.
Slides on this LSH algorithm from a talk given by Piotr Indyk.- Earlier algorithm for Euclidean space (2006): a good introduction to LSH, and the description of affairs as of 2006, is in the following book chapter
Locality-Sensitive Hashing Scheme Based on p-Stable Distributions (by Alexandr Andoni, Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab Mirrokni), appearing in the book Nearest Neighbor Methods in Learning and Vision: Theory and Practice, by T. Darrell and P. Indyk and G. Shakhnarovich (eds.), MIT Press, 2006.
See also the book introduction for a smooth introduction to NN problem and LSH. - Original LSH algorithm (1999):
the best algorithm for the Hamming space remainsprevious version of the algorithm for the Hamming distance is described in [GIM'99] paper.
- Newest (not quite) LSH algorithms (2014): These algorithms achieve performance better than the classic LSH algorithms by usingdata-dependent hashing. They improve over classic LSH algorithms for both Hamming
and Euclidean space. These algorithms are not dynamic however, in contrast to the classic LSH algorithms, which use data-independent hashing and hence allow updates to the pointset.
- Implementation of LSH: download the E2LSH package (alpha-version). The code is based on the algorithm described in the
book chapter (2006) from above. You can download the manual for the code. The code has been developed by Alex
Andoni in 2004-2005.
This research was supported in part by NSF CAREER Grant #0133849 "Approximate Algorithms for High-dimensional Geometric Problems".
from: http://web.mit.edu/andoni/www/LSH/index.html
相關文章
- LSH-區域性敏感雜湊
- 從NLP任務中文字向量的降維問題,引出LSH(Locality Sensitive Hash 區域性敏感雜湊)演算法及其思想的討論演算法
- 區域性敏感雜湊-向量相似搜尋
- 文字相似性計算--MinHash和LSH演算法演算法
- Design and implementation of database anomaly monitoring system based on AI algorithmDatabaseAIGo
- 幾道和雜湊(雜湊)表有關的面試題面試題
- Hash,雜湊,雜湊?
- 論文研究區域圖的製作方法:ArcGIS
- js 雜湊雜湊值的模組JS
- Python:說說字典和雜湊表,雜湊衝突的解決原理Python
- 七夕也要學起來,雜湊雜湊雜湊!
- 雜湊表的程式碼實現(Java)Java
- 安全的雜湊
- [翻譯] TensorFlow 分散式之論文篇 "Implementation of Control Flow in TensorFlow"分散式
- 雜湊
- Raft論文《 In Search of an Understandable Consensus Algorithm (Extended Version) 》研讀RaftGo
- 雜湊表(雜湊表)原理詳解
- 【尋跡#3】 雜湊與雜湊表
- 一文講透一致性雜湊的原理和實現
- 雜湊表的原理
- 雜湊索引索引
- 樹雜湊
- 2024.3.26 雜湊
- 雜湊碰撞
- 字串雜湊字串
- 雜湊表
- 雜湊遊戲之雜湊盒子的趨勢未來可期遊戲
- 論文閱讀筆記:Analytically-invertible dynamics with contacts and constraints: Theory and implementation in MuJoCo筆記AI
- 圖解一致性雜湊演算法,全網(小區區域網)最通俗易懂圖解演算法
- Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines 論文研讀ASTGoAIMac
- 雜湊競猜開發原理丨雜湊競猜遊戲原始碼丨雜湊競猜系統開發設計程式碼遊戲原始碼
- 淺談最長迴文子串求法——字串雜湊字串
- 雜湊表2
- 雜湊衝突
- 字串雜湊表字串
- 雜湊函式函式
- 異或雜湊
- redis之雜湊Redis
- 6.7雜湊表