Abstract:
A system and method of detecting separator lines in a web page may include determining coordinates of visible web elements on a web page, generating an edge image of the web page based on the coordinates of the web elements, filtering edges belonging to non-separator line elements within the edge image, detecting horizontal lines within the edge image, detecting vertical lines within the edge image, and filtering short lines within the edge image. A system for detecting separator lines in a web page may include a memory device, and a processor communicatively coupled to the memory, in which the processor determines coordinates of visible web elements on a web page, generates an edge image of the web page based on the coordinates of the web elements, filters edges belonging to non-separator line elements within the edge image, detects horizontal lines within the edge image, detects vertical lines within the edge image, and filters short lines within the edge image.
Abstract:
An exemplary embodiment of the present may generate a DOM-tree and generate a signal based on the DOM-tree and a node list. The signal may be analyzed and nodes may be selected within the signal to form a periodic wave. Repeat patterns may be detected using the periodic wave and the nodes.
Abstract:
A method for detection of visual separators in web pages using code analysis includes receiving a web page and its associated web code by a web page analysis device and analyzing the web code to detect visual separators in the web page. A web page analysis device for visual separator detection in web pages is also provided.
Abstract:
Presented is a method and system for sending a message using a mobile phone. The method includes composing a message for sending to a recipient, generating a contact-label having contact information of the recipient, combining the message and the contact-label, capturing an image of the message and the contact-label combination for sending to the recipient using the mobile phone, decoding the captured image for identifying the contact information of the recipient, and sending the message to identified contact information.
Abstract:
Examples of the present disclosure may include methods, systems, and computer readable media with executable instructions. An example method for determining document structure similarity can include segmenting path sequences (206) of Document Object Model (DOM) trees (120, 462) from a number of web pages (202) into B components (561). Path signals (210) corresponding to the path sequences (206) are determined based on a count of the occurrences of particular paths in the Bthe component (571), and unique path signals (210) are transformed into discrete wavelet signals (214)(572). The discrete wavelet signals (214) are analyzed at multiple DOM tree resolution levels (573).
Abstract:
An example method includes dividing the image into blocks of pixels of a predefined size; for each block, selecting one of a plurality of saturation patterns representing a binary value of one or more bits of digital watermark data corresponding to the block; and for each block, embedding the binary value of the one or more bits of corresponding digital watermark data into the block by adjusting the saturation of pixels within the block in accordance with the selected pattern.
Abstract:
Facilitating information retrieval and improving similarity score computations among objects, via a compactor component that interacts with a layered memory structure. Data structures (e.g., tuples) that are associated with objects can be compacted into a condensed format, via employing a layered memory structure. The system further includes a sorting component that is operatively coupled with the “compactor component”, to reduce memory space that is required to store and retrieve similarity scores related to various objects.
Abstract:
Disclosed is a method of blending stitched document image portions. The method identifies background pixels and foreground pixels on each boundary of the image portions. Pixels of the image portions are then modified based on a pixel value difference between corresponding background pixels on the respective boundary of the first and second portions.
Abstract:
A system and method of selecting content within a web page (110, 300) may include, with a processor (125), determining spatial coordinates of a plurality of nodes (210 through 285) within the web page (110, 300), recording coordinates of a drawn portion (610) of the web page (110, 300), and determining, with the processor (125), a number of corresponding regions (710, 910) for the drawn portion (610) of the web page (110, 300) based on the spatial coordinates of the nodes (210 through 285).
Abstract:
A system and method for selecting main content (350) from web pages includes receiving a web page (205) by a web page analysis device (105) and scoring sub-trees (209) within the web page (205). The single sub-tree (225) with the highest final score is selected as the main content (350) of the webpage (205).