Which 谷歌 Link Analysis Approach May Have 更改d?
In the 谷歌 Inside Search blog, 谷歌’s Amit Singhal发表了一篇标题为 搜索质量亮点：2月有40处更改 that told us 关于 many changes to how 谷歌 ranks pages, including the following:
Link evaluation. We often use 链接的特征 to help us figure out the topic of a linked page. We have changed how we evaluate links; in particular, we are turning off a 链接分析方法 that we used for several years. We often rearchitect or 关掉 parts of our scoring to keep our system maintainable, clean and understandable.
Curious 关于 which link analysis method 谷歌 may have stopped using, I decided to look at different link analysis methods I have seen that 谷歌 has used in the past, to try to identify a link analysis approach that they may have stopped using. I couldn’决定他们可能停止了哪一种，但是将所有这些链接分析方法都放在一个地方很有趣。
A lot of people were guessing which 链接分析方法 might have been changed, from PageRank being 关掉, to anchor text being devalued, to 谷歌 ignoring rel=”nofollow” attributes in links, to others. I was asked my opinion 通过 a few people and mentioned that there were several potential link analysis approaches that 谷歌 might have stopped using.
I’ve made a list of a dozen possibilities and granted 谷歌 patents that describe them, but 谷歌 uses link analysis in a lot of ways, and what 谷歌 关掉 might involve something else entirely, and/or something that might not even be described in a patent.
In Plex一书中提到该专利的发明人Krishna Bharat开发了一种类似于 HITS算法 that was incorporated into what 谷歌 does in 2003. This patent was granted in 2003, and it’在很多方面与HITS算法相似。
This process might be somewhat unnecessary these days, especially if 谷歌 is reranking search results based on something like the co-occurrence of terms in a result set based upon 基于短语的索引. – Ranking search results 通过 reranking the results based on 本地 inter-connectivity
A paper 关于 this type of link analysis method, written 通过 a couple of researchers who would end up becoming 谷歌 Employees is 在万维网上查找相关页面
Could 谷歌 have found a better way f finding related pages? It’可能，但显示唐的页面’似乎没有改变。– 使用基于链接的分析查找相关超链接文档的技术
谷歌 has a lot more pages indexed now than they did when the patent behind this approach was filed, and they may still need this shortcut. They’也有技术上的进步，也许没有。
那里 is a whitepaper that was written 通过 the inventors of this link analysis approach, intended to speed up how PageRank worked and make 排行 at 谷歌 faster. The paper is PageRank计算的自适应方法，由Sepandar Kamvar，Taher Haveliwala和Gene Golub撰写
新增26，2019 –最近有人告诉我们 Former 谷歌 Engineer: 谷歌 Hasn’t自2006年以来使用的PageRank。他说Google于2006年停止使用原始的PageRank，并用名称非常相似但可以更快，更有效地替换它。我猜想自适应PageRank（据估计可以为页面计算PageRank分数的速度快30％）是根据此消息最有可能取代PageRank的链接分析方法。 （我们不 ’我不知道PageRank是否是我在本文开头提到的文章中Amit Singhal所引用的链接分析方法，但是可能是。）
It might be possible to use anchor text from a link on a page in one language to understand what webpage that link is pointing to in another language, to understand what the targeted page is 关于.
谷歌 has probably clustered similar web pages 通过 looking at other pages that link to pages appearing in search results, and seeing what other pages they link to.
I wrote 关于 this link analysis method in the post How Link Based Clustering Could Allow 谷歌 to Group Search Results
谷歌 might have replaced this clustering approach with one that focuses instead more upon the content and/or the concepts contained on those pages. – 基于链接的超链接文档集群
谷歌 might use a different approach, such as one that may look at large amounts of data 关于 searchers, pages, and queries to calculate a personalized page score for pages. – 在搜索引擎中个性化锚文本得分
Using anchor text for links to determine the 关联 of the pages they point towards. It’s quite likely that 谷歌 continues to use an approach like this, but in a modified manner that might be influenced 通过 things like 基于短语的索引 – Web搜寻器系统中的锚标签索引
For more details 关于 how this link analysis approach works, I wrote a post 关于 this patent: 谷歌 Patent on Anchor Text Indexing and Crawl Rates.
In 2005, 谷歌 published a patent application that describes a wide range of temporal-based factors related to links, such as the appearance and disappearance of links, the increase, and decrease of backlinks to documents, weights to links based upon 新鲜ness, weights to links based upon authoritativeness of the documents linked from, age of links, spikes in link growth, the relatedness of anchor text to page being pointed to overtime.
谷歌 may have used some of the factors described in this patent and continue to use them or replaced them with something else, and it might have ignored others, – 基于历史数据的信息检索
I’ve撰写了有关该专利的一些文章，以及许多延续专利，这些专利更新了其涵盖的链接分析方法的各个方面。我还找到了该专利的一个较早版本（一个临时版本）并进行了撰写，还有一个延续专利，其重点仅在于原始专利中的某些权利要求。如果该专利引起了您的注意，您可能会发现我的文章有趣。它是在： Revisiting 谷歌’s Information Retrieval Based Upon 历史数据
We’ve known for a few years that 谷歌 will give different weights for links based upon segments of a page where a link is located. It’很有可能今天会继续使用类似的方法，但是可能已经以某种方式进行了修改，例如以某种方式限制了链接的传递量，例如，如果链接出现在页脚上，一个网站的多个页面。
Then again, 谷歌 probably has already been doing that. – 基于视觉间隙的文档分割
谷歌 filed a much more detailed patent focused more upon segmentation of any pages, and not just 本地 pages. This patent can be found at: 确定文档在语义上不同的区域
While both of these patents go beyond link analysis, the location of a link on a page can make a difference regarding how much weight a link might carry. I wrote a more detailed post 关于 the second patent at: 谷歌s Page Segmentation Patent Granted
谷歌’s 合理的冲浪者 model describes a good number of features that might be taken together to determine how much value a link might pass along from a page 关于 other links on that page, and one or more of those values may be no longer considered in a way that they might have been in the past. – 根据用户行为和/或功能数据对文档进行排名
I’ve written a couple of posts 关于 the 合理的冲浪者模型 link analysis approach, because it is an interesting one, and because it was updated at least once. Those posts are:
- 谷歌’s 合理的冲浪者: How the Value of a Link May Differ Based upon Link and Document Features and User Data
- 谷歌’s 合理的冲浪者 Model Updated
I wrote 关于 this patent in much more detail in the post: 谷歌’s Affiliated Page Link Patent
Assigning 关联 of one web page to other web pages could be based upon the distance of clicks between the pages and/or certain features in the content of anchor text or URLs. For example, if one-page links to another with the word “contact” or the word “about”，并且要链接的页面包括一个地址，则该地址位置可能被认为与进行该链接的页面有关。
那里 are a few different parts to this method of having the 关联 of one page on a site propagated to other pages on the same site, and one or more of those could have changed if it is in use. – 在相关网页（例如网站的网页）之间传播有用的信息
I wrote a post 关于 this patent at 谷歌 Determining Search Authority Pages and Propagating Authority to Related Pages
什么“链接分析方法” do you think 谷歌 关掉?