A METHOD AND APPARATUS FOR TABLE RECOGNITION

    公开(公告)号:US20250166405A1

    公开(公告)日:2025-05-22

    申请号:US18727310

    申请日:2022-12-13

    Abstract: This application discloses a method for table recognition which can acquire an image to be processed that includes a table, and determine information about individual table cells in the image to be processed, the information about the individual table cells comprising positions of bounding boxes of the individual table cells. And then, parent table cells of the individual table cells in a row direction and parent table cells of the individual table cells in a column direction are obtained based on the information about the individual table cells. Further, structural coordinates of the individual table cells can be obtained based on parent-child relationships of the individual table cells in the row direction and parent-child relationships of the individual table cells in the column direction, wherein a structural coordinate comprises a starting row, a starting column, a terminating row, and a terminating column.

    METHOD, APPARATUS, READABLE MEDIUM AND ELECTRONIC DEVICE OF KEY-VALUE MATCHING

    公开(公告)号:US20250029362A1

    公开(公告)日:2025-01-23

    申请号:US18714402

    申请日:2022-11-01

    Abstract: A method, apparatus, readable medium and electronic device of key-value matching, the method inputs the image to be detected into a predetermined key-value matching model, to cause the predetermined key-value matching model to output a matching relationship between the attribute data and the attribute value data, in this way, it can not only provide an end-to-end network model for key-value matching, effectively improve the efficiency of key-value matching, but also obtain the target attribute value data region and the target attribute data region of higher accuracy by the semantic segmentation submodel in the predetermined key-value matching model, and then determine the matching relationship between the attribute data and the attribute value data in the image to be detected based on the target attribute data region and the target attribute value data region by the image matching submodel, thereby effectively improving the accuracy of the key-value matching result.

    IMAGE RESTORATION METHOD AND APPARATUS, DEVICE, MEDIUM AND PRODUCT

    公开(公告)号:US20250131535A1

    公开(公告)日:2025-04-24

    申请号:US18834506

    申请日:2023-02-23

    Abstract: The present disclosure provides an image restoration method and apparatus, a device, a medium and a product. The method includes: acquiring an image to be restored; and then, inputting the image to be restored into a structure restoration model, obtaining a first feature sequence and a second feature sequence by down-sampling the image to be restored based on a plurality of branches of the structure restoration model, converting the first feature sequence into a third feature sequence that has the same length as the second feature sequence, fusing the third feature sequence with the second feature sequence, and obtaining an image in which the structure of the image to be restored is restored by performing structure restoration on the image to be restored according to a fused feature sequence. In this way, a restored image with higher restoration precision and a better effect can be obtained.

    METHOD, APPARATUS, READABLE STORAGE MEDIUM, AND ELECTRONIC DEVICE FOR OBJECT ATTRIBUTE RECOGNITION

    公开(公告)号:US20250095327A1

    公开(公告)日:2025-03-20

    申请号:US18730536

    申请日:2022-12-26

    Abstract: The disclosure relates to a method, apparatus, readable storage medium, and electronic device for object attribute recognition. The method includes: acquiring a target image, the target image comprising a target object and object description information of the target object; extracting, from the target image, a sequence of key information features of the target object and a sequence of multimodal features corresponding to a target attribute of the target object, the sequence of multimodal features comprising a sequence of visual features and a sequence of semantic features of the target attribute; and determining a plurality of object attributes of the target object based on the sequence of key information features and the sequence of multimodal features.

    IMAGE DESCRIPTION GENERATION METHOD AND APPARATUS, DEVICE, MEDIUM, AND PRODUCT

    公开(公告)号:US20250104453A1

    公开(公告)日:2025-03-27

    申请号:US18832018

    申请日:2023-02-27

    Abstract: The present disclosure provides an image description generation method and apparatus, a device, a medium, and a product, and relates to the technical field of image processing. The method includes obtaining an image including a target object; respectively extracting a label feature of the target object, a position feature of the target object in the image, a text feature in the image, and a visual feature of the target object from the image; and generating a natural language description for the image according to the label feature, the position feature, the text feature, the visual feature, and a visual linguistic model. It is apparent that through the method, more effective information is extracted from the image, such that the model can better understand the image, thereby improving a matching degree between the obtained natural language description and the target object in the image.

Patent Agency Ranking