Similarity measure design on big data

Sanghyuk Lee*, Yan Sun

*Corresponding author for this work

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

Clustering algorithm in big data was designed, and its idea was based on defining similarity measure. Traditional similarity measure on overlapped data was illustrated, and application to non-overlapped data was carried out. Similarity measure on high dimension data was obtained through getting information from neighbor data. Its usefulness was proved, and verified by calculation of similarity for artificial data example.

Original languageEnglish
Title of host publicationFuture Information Communication Technology and Applications, ICFICE 2013
Pages811-819
Number of pages9
DOIs
Publication statusPublished - 2013
Event2013 International Conference on Future Information and Communication Engineering, ICFICE 2013 - Shenyang, China
Duration: 24 Jun 201326 Jun 2013

Publication series

NameLecture Notes in Electrical Engineering
Volume235 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference2013 International Conference on Future Information and Communication Engineering, ICFICE 2013
Country/TerritoryChina
CityShenyang
Period24/06/1326/06/13

Keywords

  • Big data
  • High dimension data
  • Neighbor information
  • Similarity measure

Fingerprint

Dive into the research topics of 'Similarity measure design on big data'. Together they form a unique fingerprint.

Cite this