Are large 官方网投 agencies, with a wide scope of international coverage on multiple topics, with large numbers of reporters, and finely edited 文章 s better sources of 官方网投 than smaller and more local papers, or narrow niche blogs?
A patent on ranking 文章 s 在谷歌 News was granted this week that was originally filed in 2003, and it discusses several ranking factors that it might use to present 官方网投 文章 s based upon the “quality” of the 官方网投 sources involved.
专利没有’t include a full range of signals that Google probably considers in ranking 官方网投 stories, such as the freshness of the 官方网投 (as 注意到的 在谷歌’的专利申请在Universal Search上），或者某个来源是否为原始来源。
顺便说一句，关于从Google研究人员的官方网投文章，博客文章或网页中查找内容的实时或非常实时的来源的主题是一篇技术性强但有趣的论文， 有效地检测文本段的来源 （pdf）。
The premise behind developing 质量 signals for 官方网投 文章 s is established early on in the patent:
For example, suppose a person wishes to obtain the latest 官方网投 regarding a particular topic via the Internet. The person accesses a web site that includes a conventional search 发动机。 The person enters one or more terms relating to the topic of interest, such as “Iraq,” into the search engine to attempt to locate a 官方网投 source that has published an 文章 relating to the topic.
Using a search engine in this manner to locate individual web sites that provide 官方网投 文章 s relating to the desired topic often results in a ranked list of hundreds or even thousands of “hits,”其中每个匹配可能对应于与搜索字词相关的网页。
While each of the hits in the ranked list may relate to the desired topic, the 官方网投 sources associated with these 点击 however, may not be of uniform 质量.
For example, CNN and BBC are widely regarded as high-quality sources of accuracy of reporting, professionalism in writing, etc., while local 官方网投 sources, such as hometown 官方网投 sources, may be of lower 质量.
那里fore, there exists a need for systems and methods for improving the ranking of 官方网投 文章 s based on the 质量 of the 官方网投 source with which the 文章 s are associated.
I’m questioning that assumption, that sources such as CNN or BBC, maybe better sources of 质量 信息 than hometown 官方网投 sources in many instances. I think it’s often possible that a local reporter and a local hometown 官方网投 source may hold the potential to provide details and insights and 信息 that a larger organization may miss. It is worth looking at the signals that are listed in the patent, though.
Systems and methods for improving the ranking of 官方网投 文章 s
由Michael Curtiss，Krishna Bharat和Michael Schmitt发明
A system ranks results. The system may receive a list of links. The system may identify a source with which each of the links is associated and rank the list of links based at least in part on a 质量 of the identified sources.
The process of coming up with a source rank score for a 官方网投 source is based upon looking at several metrics for each 官方网投 source, which measure different attributes of the source.
Number of 文章 s produced 通过 the 官方网投 source during a given time period
据推测，在一段时间内，源产生的文章（非重复文章）越多越好。我们’re told that as an alternative, the search engine might consider the number of original sentences published 通过 the 官方网投 source during that time.
Average length of an 文章 from the 官方网投 source
较长的文章更好吗？如果搜索引擎要看CNN’s top 100 官方网投 stories from the past week, and the top 100 官方网投 stories from another source, and compare the length of those, should the source with the longest 文章 s
be considered higher 质量? If the search engine instead clustered together all 文章 s on a specific story and looked at the length of those, would the longest again be the higher 质量 story? This metric appears to indicate that it is a signal to consider.
Breaking 官方网投 score
How soon after an important event happens does the 官方网投 source publish a story about it? If all of the stories about that event were clustered together, and the publication dates and times were viewed, the sources that responded quickest would have a higher “breaking 官方网投 score.”
If the search engine were to track how many people followed links to particular 官方网投 sources when they were presented with links to those sources, which sources did people tend to visit more? This doesn’t measure the “popularity” of 官方网投 sources as much as it does whether or not people follow links to particular sources when they see those links in search results.
Human opinion of the 官方网投 source
People WHO use the search engine may be polled to identify 官方网投 sources that they enjoy reading or have visited. Other measures may also be used as well. For instance, we are told that 官方网投papers can be compared based at least in part on the number of Pulitzer prizes the papers have won. We’re also told that the age of a 官方网投 source “可能被视为公众的信心衡量标准。”作为另一种选择，可以向评估人员显示来自不同来源的文章的选择，并要求评估者为其来源分配得分。
Circulation statistics of the 官方网投 source
与来源相关的印刷出版物的发行量统计，代理商使用情况统计“例如Media Metrix和Nielsen Netratings，”以及其他可能的方法来衡量到源的流量。
The size of the staff associated with the 官方网投 source
The numbers of distinct journalist names from 文章 s in the 官方网投 source might be viewed.
The number of 官方网投 bureaus associated with the 官方网投 source
This seems to favor larger and more established 官方网投 agencies.
Original named entities appearing in 文章 s produced 通过 the 官方网投 source
如果有关某个特定事件的所有故事都聚集在一起，并且其中包含提及命名实体的提及，而同一主题的其他文章则没有’t include, it might rank higher than others. This metric is supposed to indicate that 官方网投 sources are “能够进行原始报告。”使用此方法有一些限制。例如，可以考虑文章的发布日期以查看哪个文章何时包含哪个命名实体。在确定文章中的命名实体是否唯一时，还可以查看拼写和缩写的变化。
Articles from 官方网投 sources might be categorized into different topics, and the range of those topics might be considered as an indication of the breadth of that source. This seems to favor more general sources than ones focused upon a narrower niche. A more focused source may have higher 质量 文章 s about the topics that they specialize in.
International diversity of the 官方网投 source
This looks at the number of countries from which the 官方网投 site receives traffic on the Web. The search engine might look at something like the IP addresses of people WHO click through links to the sources, to see how to spread out their audience might be across the globe.
The writing style used 通过 the 官方网投 source
The search engine might use automated tests to measure spelling, grammar, and reading levels for a 官方网投 source.
Other signals might also be considered, such as the number of links that might be seen pointing to the 官方网投 web site.
While this was filed almost 6 years ago, it does provide details for an algorithmic approach to assigning scores for 官方网投 sources that could be used to rank 官方网投 文章 s 在谷歌 News, and many of the assumptions behind specific factors in that algorithm. It’可能今天仍在使用该算法的某些版本，并且可能还会使用许多所涉及的排名因素。
For instance, if a breaking story came out about a discovery in Physics, and a reputable and well-respected site on Physics News published an insightful and detailed 文章 on the discovery, it could be a better source for the topic than a 官方网投 source which may have written about the discovery first, has many more reporters and much wider circulation, gets seen 通过 a much more international audience, has a wide number of 官方网投 bureaus, has been publishing since the 1800s, and was written 通过 someone WHO doesn’根本不了解物理学。