Abstract:
Disclosed herein is a method, apparatus, and computer program for providing sub-content while providing online content. The method, apparatus, and computer program for providing sub-content while providing online content provides predetermined sub-content before providing online main content to a user, displays a predetermined user interface after a certain period of time, and continuously provides the sub-content only when the user selects the user interface.
Abstract:
Disclosed herein is a method, apparatus, and computer program for providing sub-content while providing online content. The method, apparatus, and computer program for providing sub-content while providing online content provides predetermined sub-content before providing online main content to a user, displays a predetermined user interface after a certain period of time, and continuously provides the sub-content only when the user selects the user interface.
Abstract:
Disclosed is a method and system, the method including extracting similar and dissimilar document pair sets from a document database, the similar document pair set including similar document pairs having a common attribute, and the dissimilar document pair set including dissimilar document pairs extracted randomly, calculating a mathematical similarity for each of the similar and dissimilar document pairs using a mathematical measure to obtain a first and second mathematical similarities, calculating a semantic similarity for each of the similar and dissimilar document pairs to obtain a first and second semantic similarities, the first semantic similarities being higher than the first mathematical similarities, and the second semantic similarities being lower than the second mathematical similarities, training a similarity model based on the similar and dissimilar document pairs, and the first and second semantic similarities to obtain a trained similarity model, and detecting a duplicate document using the trained similarity model.
Abstract:
Disclosed is a method and system for detecting a duplicate document using vector quantization. A duplicate document detection method may include acquiring, by processing circuitry, a respective vector expression for each of a plurality of documents using a similarity model, the similarity model being trained to output similar vector expressions for semantically similar documents, generating a key by performing a vector quantization on the respective vector expression, the key including a binary character string, and detecting a duplicate document from among the plurality of documents using the key.