Feature-based deduplication of metadata for places
摘要:
The technology disclosed relates to deduplicating metadata about places. A feature generator module is configured to generate features for metadata profiles. The metadata profiles represent a plurality of places. The features are based on geohash strings and word embeddings generated for the metadata profiles. A diff generator module is configured to generate diff vectors that pair-wise encode results of comparison between features of paired metadata profiles. A classification module is configured to generate similarity scores for the paired metadata profiles based on the diff vectors. A particular similarity score indicates whether metadata profiles in a particular pair of metadata profiles represent a same place.
公开/授权文献
信息查询
0/0