Social-Based Optimization of Web Crawling for Online Social Networks
    1.
    发明申请
    Social-Based Optimization of Web Crawling for Online Social Networks 有权
    网络社会网络爬网的社会优化

    公开(公告)号:US20160125082A1

    公开(公告)日:2016-05-05

    申请号:US14533229

    申请日:2014-11-05

    Applicant: FACEBOOK, INC.

    Inventor: Vojin Katic

    CPC classification number: G06F17/30864 G06Q50/01

    Abstract: In one embodiment, a method includes a search engine of an online social network crawling a first webpage of a first web domain, where the first webpage includes links to one or more second webpages, each of which may be within a second web domain, accessing a domain ranking for each second web domain, where for each second web domain the domain ranking may be based on one or more domain-quality signals associated with the second web domain, where the domain-quality signals may include a measure of activations of social plug-ins of the online social network associated with webpages of the second web domain, selecting one or more of the second webpages to crawl based at least in part on the domain ranking of the second web domain associated with the second webpage, and the search engine of the online social network crawling each selected second webpage.

    Abstract translation: 在一个实施例中,一种方法包括在线社交网络的搜索引擎,其爬行第一网域的第一网页,其中第一网页包括指向一个或多个第二网页的链接,每个网页可以在第二网域内,访问 每个第二网域的域排名,其中对于每个第二网域,域排名可以基于与第二网域相关联的一个或多个域质量信号,其中域质量信号可以包括社会活动的度量 所述在线社交网络的插件与所述第二网域的网页相关联,至少部分地基于与所述第二网页相关联的所述第二网域的域排名选择所述第二网页中的一个或多个以进行爬网,以及所述搜索 在线社交网络的引擎抓取每个选定的第二个网页。

    Ranking external content on online social networks

    公开(公告)号:US10268763B2

    公开(公告)日:2019-04-23

    申请号:US14341148

    申请日:2014-07-25

    Applicant: Facebook, Inc.

    Inventor: Vojin Katic

    Abstract: In one embodiment, a social-networking system may access an enhanced search index of an online social network. The enhanced search index may include data from a social graph having a plurality of nodes and a plurality of edges connecting the nodes, where the nodes comprise a plurality of internal nodes corresponding to entities associated with the online social network, and a plurality of external nodes corresponding to objects associated with a third-party system. The social-networking system may then search the enhanced search index in response to a query received from a user to identify objects that substantially match the query. Each identified object may be scored by the social-networking system based at least in part on a connectivity of the corresponding external node to the one or more internal nodes. In response to the query, the social-networking system may send a search-results page referencing objects based on their scores.

    Generating preview data for online content
    3.
    发明授权
    Generating preview data for online content 有权
    生成在线内容的预览数据

    公开(公告)号:US09442903B2

    公开(公告)日:2016-09-13

    申请号:US14174627

    申请日:2014-02-06

    Applicant: FACEBOOK, INC.

    Inventor: Vojin Katic

    Abstract: Social networking systems benefit from techniques that improve the ability of users to share online content with other users of a social networking system. In one embodiment, when a user types, pastes, or otherwise inserts a URL, or some other hyperlink, into a message or post to the social networking system, a set of data on the referenced hyperlink target is acquired and stored on a server of the social networking system. The stored data is analyzed, to automatically generate a preview for the hyperlink; and the hyperlink preview is transmitted to the client device for approval. In one embodiment, follow-up actions related to the content are performed when the content is posed, which enables users to perform social graph actions to user nodes and concept nodes related to the message or post. In one embodiment, the shared content is cached on the social networking system.

    Abstract translation: 社交网络系统受益于提高用户与社交网络系统的其他用户共享在线内容的能力的技术。 在一个实施例中,当用户键入,粘贴或以其他方式将URL或其他一些超链接插入消息或者发布到社交网络系统时,获取所引用的超链接目标上的一组数据并将其存储在 社交网络系统。 分析存储的数据,自动生成超链接的预览; 并将超链接预览传送到客户端设备进行审批。 在一个实施例中,当提出内容时执行与内容相关的后续动作,这使得用户能够对与消息或帖子相关的用户节点和概念节点执行社交图形动作。 在一个实施例中,共享内容被缓存在社交网络系统上。

    Generating preview data for online content

    公开(公告)号:US10133710B2

    公开(公告)日:2018-11-20

    申请号:US14174676

    申请日:2014-02-06

    Applicant: FACEBOOK, INC.

    Inventor: Vojin Katic

    Abstract: Social networking systems benefit from techniques that improve the ability of users to share online content with other users of a social networking system. In one embodiment, when a user types, pastes, or otherwise inserts a URL, or some other hyperlink, into a message or post to the social networking system, a set of data on the referenced hyperlink target is acquired and stored on a server of the social networking system. The stored data is analyzed, to automatically generate a preview for the hyperlink; and the hyperlink preview is transmitted to the client device for approval. In one embodiment, follow-up actions related to the content are performed when the content is posed, which enables users to perform social graph actions to user nodes and concept nodes related to the message or post. In one embodiment, the shared content is cached on the social networking system.

    Maintaining cached data extracted from a linked resource

    公开(公告)号:US09832284B2

    公开(公告)日:2017-11-28

    申请号:US14141678

    申请日:2013-12-27

    Applicant: Facebook, Inc.

    CPC classification number: H04L67/42 G06F17/30902 H04L67/2842

    Abstract: Exemplary methods, apparatuses, and systems include a network service receiving a request including a hyperlink. The network service acquires data from a resource referenced by the hyperlink. The network service stores the acquired data within a network service cache and sets a refresh interval. The network service utilizes the stored data to respond to additional requests including the hyperlink received during the refresh interval. The network service reacquires data from the resource after the expiration of the refresh interval. The refresh interval is updated by increasing or decreasing a frequency of the refresh interval in response to an amount of change to data associated with the resource over time.

    Social-based optimization of web crawling for online social networks

    公开(公告)号:US09703870B2

    公开(公告)日:2017-07-11

    申请号:US14533229

    申请日:2014-11-05

    Applicant: Facebook, Inc.

    Inventor: Vojin Katic

    CPC classification number: G06F17/30864 G06Q50/01

    Abstract: In one embodiment, a method includes a search engine of an online social network crawling a first webpage of a first web domain, where the first webpage includes links to one or more second webpages, each of which may be within a second web domain, accessing a domain ranking for each second web domain, where for each second web domain the domain ranking may be based on one or more domain-quality signals associated with the second web domain, where the domain-quality signals may include a measure of activations of social plug-ins of the online social network associated with webpages of the second web domain, selecting one or more of the second webpages to crawl based at least in part on the domain ranking of the second web domain associated with the second webpage, and the search engine of the online social network crawling each selected second webpage.

    Ranking External Content on Online Social Networks
    7.
    发明申请
    Ranking External Content on Online Social Networks 审中-公开
    排名在线社交网络的外部内容

    公开(公告)号:US20160026713A1

    公开(公告)日:2016-01-28

    申请号:US14341148

    申请日:2014-07-25

    Applicant: Facebook, Inc.

    Inventor: Vojin Katic

    Abstract: In one embodiment, a social-networking system may access an enhanced search index of an online social network. The enhanced search index may include data from a social graph having a plurality of nodes and a plurality of edges connecting the nodes, where the nodes comprise a plurality of internal nodes corresponding to entities associated with the online social network, and a plurality of external nodes corresponding to objects associated with a third-party system. The social-networking system may then search the enhanced search index in response to a query received from a user to identify objects that substantially match the query. Each identified object may be scored by the social-networking system based at least in part on a connectivity of the corresponding external node to the one or more internal nodes. In response to the query, the social-networking system may send a search-results page referencing objects based on their scores.

    Abstract translation: 在一个实施例中,社交网络系统可以访问在线社交网络的增强的搜索索引。 增强搜索索引可以包括具有连接节点的多个节点和多个边缘的社交图表的数据,其中节点包括对应于与在线社交网络相关联的实体的多个内部节点,以及多个外部节点 对应于与第三方系统相关联的对象。 然后,社交网络系统可以响应于从用户接收的查询来搜索增强搜索索引,以识别与查询基本匹配的对象。 至少部分地基于相应的外部节点与一个或多个内部节点的连接性,社交网络系统可以对每个识别的对象进行评分。 响应于该查询,社交网络系统可以基于他们的分数发送引用对象的搜索结果页面。

    MAINTAINING CACHED DATA EXTRACTED FROM A LINKED RESOURCE
    8.
    发明申请
    MAINTAINING CACHED DATA EXTRACTED FROM A LINKED RESOURCE 有权
    维护从链接的资源中提取的缓存数据

    公开(公告)号:US20150186390A1

    公开(公告)日:2015-07-02

    申请号:US14141678

    申请日:2013-12-27

    Applicant: Facebook, Inc.

    CPC classification number: H04L67/42 G06F17/30902 H04L67/2842

    Abstract: Exemplary methods, apparatuses, and systems include a network service receiving a request including a hyperlink. The network service acquires data from a resource referenced by the hyperlink. The network service stores the acquired data within a network service cache and sets a refresh interval. The network service utilizes the stored data to respond to additional requests including the hyperlink received during the refresh interval. The network service reacquires data from the resource after the expiration of the refresh interval. The refresh interval is updated by increasing or decreasing a frequency of the refresh interval in response to an amount of change to data associated with the resource over time.

    Abstract translation: 示例性方法,装置和系统包括接收包括超链接的请求的网络服务。 网络服务从超链接引用的资源获取数据。 网络服务将获取的数据存储在网络服务高速缓存内并设置刷新间隔。 网络服务利用所存储的数据来响应包括在刷新间隔期间接收的超链接的附加请求。 网络服务在刷新间隔到期后重新获取资源中的数据。 响应于与资源相关联的数据随时间的变化量,增加或减少刷新间隔的频率来更新刷新间隔。

    Social-based optimization of web crawling for online social networks

    公开(公告)号:US10719564B2

    公开(公告)日:2020-07-21

    申请号:US15610226

    申请日:2017-05-31

    Applicant: Facebook, Inc.

    Inventor: Vojin Katic

    Abstract: In one embodiment, a method includes identifying, by a search engine of an online social network, web domains external to the online social network. The method includes accessing domain-quality signals associated with each web domain. At least one of the domain-quality signals includes a measure of activations of social plug-ins of the online social network available on webpages of each web domain, a social plug-in being an executable script providing an activable user-interface element for interacting with the online social network from the webpage. The method includes calculating, for each web domain, a domain ranking based at least in part on the domain-quality signals associated with the web domain. The method includes identifying, by the search engine, some of the web domains as low-quality web domains to avoid accessing based at least in part on the domain rankings of the web domains not satisfying a threshold domain ranking.

    Social-Based Optimization of Web Crawling for Online Social Networks

    公开(公告)号:US20170270206A1

    公开(公告)日:2017-09-21

    申请号:US15610226

    申请日:2017-05-31

    Applicant: Facebook, Inc.

    Inventor: Vojin Katic

    Abstract: In one embodiment, a method includes identifying, by a search engine of an online social network, web domains external to the online social network. The method includes accessing domain-quality signals associated with each web domain. At least one of the domain-quality signals includes a measure of activations of social plug-ins of the online social network available on webpages of each web domain, a social plug-in being an executable script providing an activable user-interface element for interacting with the online social network from the webpage. The method includes calculating, for each web domain, a domain ranking based at least in part on the domain-quality signals associated with the web domain. The method includes identifying, by the search engine, some of the web domains as low-quality web domains to avoid accessing based at least in part on the domain rankings of the web domains not satisfying a threshold domain ranking.

Patent Agency Ranking