<?xml version="1.0" encoding="utf-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Microsoft Research Hits on Google&#8217;s Page&#160;Rank</title>
	<atom:link href="http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/</link>
	<description></description>
	<lastBuildDate>Tue, 14 Feb 2012 17:43:21 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=</generator>
	<item>
		<title>By: Martin</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083667</link>
		<dc:creator>Martin</dc:creator>
		<pubDate>Tue, 05 Aug 2008 21:14:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083667</guid>
		<description>@Paul: i dont think it does really matter if the time some users  spend at a page is &quot;quality time&quot;.
because it is on average the result should only be influenced marginaly.
however, there should be a limit to the score a page is able to get due to spending time, since the time spent on certain kind of pages is on average very different  from that on other kind of pages (for example lexicon with short explanations versus university with linked diploma thesis). which would by the way address both problems.
it is kind of interesting though, that the concept is pretty much the opposite of the TFIDF concept.
it is of course a good thing to optimize the static ranking methods, but i think there is a lot more to achieve on the dynamic side.
i would look for better search options,  a topic selection (or search)e.g. then performing the actual search and rank them against those criteria. i am not talking about semantic search, thats probably a long way to go, to work for arbitrary topics/criteria. just some heuristics using what we already got</description>
		<content:encoded><![CDATA[<p>@Paul: i dont think it does really matter if the time some users  spend at a page is &#8220;quality time&#8221;.<br />
because it is on average the result should only be influenced marginaly.<br />
however, there should be a limit to the score a page is able to get due to spending time, since the time spent on certain kind of pages is on average very different  from that on other kind of pages (for example lexicon with short explanations versus university with linked diploma thesis). which would by the way address both problems.<br />
it is kind of interesting though, that the concept is pretty much the opposite of the TFIDF concept.<br />
it is of course a good thing to optimize the static ranking methods, but i think there is a lot more to achieve on the dynamic side.<br />
i would look for better search options,  a topic selection (or search)e.g. then performing the actual search and rank them against those criteria. i am not talking about semantic search, thats probably a long way to go, to work for arbitrary topics/criteria. just some heuristics using what we already got</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frank McCown</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083263</link>
		<dc:creator>Frank McCown</dc:creator>
		<pubDate>Tue, 29 Jul 2008 02:41:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083263</guid>
		<description>&quot;Google’s PageRank is not only dependent on links to websites but also on some other signals which its algorithn used to determine the position of websites on search engine results page.&quot;

I think you might be misunderstanding what PageRank is.  The Microsoft researchers are talking about the PageRank algorithm that was introduced by Page and Brin in a research paper. PageRank is *solely* reliant on incoming and outgoing links.  Google uses PageRank as only one factor when determining which pages are relevant to a query.  When two pages that for all purposes are equal in content match a query, it&#039;s the page with the higher PageRank that will appear before the other page in the SERP.

MS would not use just their BrowseRank in a ranking algorithm... they would also use a PageRank-like measure that accounts for the web graph.</description>
		<content:encoded><![CDATA[<p>&#8220;Google’s PageRank is not only dependent on links to websites but also on some other signals which its algorithn used to determine the position of websites on search engine results page.&#8221;</p>
<p>I think you might be misunderstanding what PageRank is.  The Microsoft researchers are talking about the PageRank algorithm that was introduced by Page and Brin in a research paper. PageRank is *solely* reliant on incoming and outgoing links.  Google uses PageRank as only one factor when determining which pages are relevant to a query.  When two pages that for all purposes are equal in content match a query, it&#8217;s the page with the higher PageRank that will appear before the other page in the SERP.</p>
<p>MS would not use just their BrowseRank in a ranking algorithm&#8230; they would also use a PageRank-like measure that accounts for the web graph.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: paul</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083175</link>
		<dc:creator>paul</dc:creator>
		<pubDate>Mon, 28 Jul 2008 07:11:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083175</guid>
		<description>But, how can they distinguish the the elpased time in the page between the meaningful and meaningless? Even the user left there seat from the PC, the browser will counting the time...</description>
		<content:encoded><![CDATA[<p>But, how can they distinguish the the elpased time in the page between the meaningful and meaningless? Even the user left there seat from the PC, the browser will counting the time&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Liz</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083115</link>
		<dc:creator>Liz</dc:creator>
		<pubDate>Sat, 26 Jul 2008 16:25:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083115</guid>
		<description>I agree with comments made by Oded,
There are number of factors involved.</description>
		<content:encoded><![CDATA[<p>I agree with comments made by Oded,<br />
There are number of factors involved.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Oded</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083114</link>
		<dc:creator>Oded</dc:creator>
		<pubDate>Sat, 26 Jul 2008 16:17:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083114</guid>
		<description>There is no doubt that PR is easily manipulated. However, in today&#039;s SEO savvy  world, I believe people are much more aware of what PR really is.

Basically PR is just an indicator of the amount/quality of inbound links and it is only one indicator out of many more. PR by itself is meaning less. I mean you cannot compare a PR6 site and a PR3 claiming &quot;ah the PR6 site is much better&quot;.  Get a lot of high PR links and you&#039;ll get high PR... but that&#039;s about it. It doesn&#039;t indicate any ranking in the SERP.  You can have a PR6 homepage with no ranking while that PR3 homepage has ranking. Plus considering live.com&#039;s performance... Well I wouldn&#039;t consider Microsoft as too much of an authority regarding search technologies :)</description>
		<content:encoded><![CDATA[<p>There is no doubt that PR is easily manipulated. However, in today&#8217;s SEO savvy  world, I believe people are much more aware of what PR really is.</p>
<p>Basically PR is just an indicator of the amount/quality of inbound links and it is only one indicator out of many more. PR by itself is meaning less. I mean you cannot compare a PR6 site and a PR3 claiming &#8220;ah the PR6 site is much better&#8221;.  Get a lot of high PR links and you&#8217;ll get high PR&#8230; but that&#8217;s about it. It doesn&#8217;t indicate any ranking in the SERP.  You can have a PR6 homepage with no ranking while that PR3 homepage has ranking. Plus considering live.com&#8217;s performance&#8230; Well I wouldn&#8217;t consider Microsoft as too much of an authority regarding search technologies :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: wolf</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083109</link>
		<dc:creator>wolf</dc:creator>
		<pubDate>Sat, 26 Jul 2008 14:52:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083109</guid>
		<description>The FAROO P2P Search Engine has been doing something very similar for some time already.
http://www.faroo.com/english/technology/architecture.html

FAROO&#039;s &quot;If users spend a long time on a page, visit it often, put it to bookmarks or print it out, this page goes up in ranking.&quot;
http://altsearchengines.com/2007/10/02/great-debate-peer-to-peer-p2p-search-part-i/
sounds very familiar to Microsoft&#039;s
&quot;The more visits of the page made by the users and the longer time periods spent by the users on the page, the more likely the page is important.&quot;
http://research.microsoft.com/users/tyliu/files/fp032-Liu.pdf
doesn&#039;t it?

A very significant difference is though, that FAROO maintains the privacy of the user because it calculates the PeerRank in a decentralized manner, while Microsoft would collect all click streams of all users in a central server.

It&#039;s great to see that Microsoft research paper confirms that attention based ranking is able to outperform PageRank both for relevancy and for spam suppression.</description>
		<content:encoded><![CDATA[<p>The FAROO P2P Search Engine has been doing something very similar for some time already.<br />
<a href="http://www.faroo.com/english/technology/architecture.html" rel="nofollow">http://www.faroo.com/english/technology/architecture.html</a></p>
<p>FAROO&#8217;s &#8220;If users spend a long time on a page, visit it often, put it to bookmarks or print it out, this page goes up in ranking.&#8221;<br />
<a href="http://altsearchengines.com/2007/10/02/great-debate-peer-to-peer-p2p-search-part-i/" rel="nofollow">http://altsearchengines.com/2007/10/02/great-debate-peer-to-peer-p2p-search-part-i/</a><br />
sounds very familiar to Microsoft&#8217;s<br />
&#8220;The more visits of the page made by the users and the longer time periods spent by the users on the page, the more likely the page is important.&#8221;<br />
<a href="http://research.microsoft.com/users/tyliu/files/fp032-Liu.pdf" rel="nofollow">http://research.microsoft.com/users/tyliu/files/fp032-Liu.pdf</a><br />
doesn&#8217;t it?</p>
<p>A very significant difference is though, that FAROO maintains the privacy of the user because it calculates the PeerRank in a decentralized manner, while Microsoft would collect all click streams of all users in a central server.</p>
<p>It&#8217;s great to see that Microsoft research paper confirms that attention based ranking is able to outperform PageRank both for relevancy and for spam suppression.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: gfigg</title>
		<link>http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/comment-page-1/#comment-1083087</link>
		<dc:creator>gfigg</dc:creator>
		<pubDate>Fri, 25 Jul 2008 23:48:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.searchenginejournal.com/microsoft-research-hits-on-googles-page-rank/7368/#comment-1083087</guid>
		<description>Very interesting, although this has been tried before.  DirectHit had a search engine built entirely on clickstream data (Acquired by Ask.com in 2000).  They got the data from ISPs in those days.  The end-result is really not that much better than Page-Rank.  

We at Me.dium on the other hand (http://me.dium.com/search) are processing our user&#039;s clickstream data in real-time to create a different lens based on what&#039;s going on now.  e.g. do a search for John Edwards on Google or Live, and you get johnedwards.com and wiki/johnedwards.  Do the same search on Me.dium and you learn that today people care about his love child, pictures of his mistress, etc.

The difference is real-time (what people are browsing now) vs. historical (what they browsed in the past).  Social vs. Old School.  Check it out and let us know your thoughts.  http://me.dium.com/search.</description>
		<content:encoded><![CDATA[<p>Very interesting, although this has been tried before.  DirectHit had a search engine built entirely on clickstream data (Acquired by Ask.com in 2000).  They got the data from ISPs in those days.  The end-result is really not that much better than Page-Rank.  </p>
<p>We at Me.dium on the other hand (<a href="http://me.dium.com/search" rel="nofollow">http://me.dium.com/search</a>) are processing our user&#8217;s clickstream data in real-time to create a different lens based on what&#8217;s going on now.  e.g. do a search for John Edwards on Google or Live, and you get johnedwards.com and wiki/johnedwards.  Do the same search on Me.dium and you learn that today people care about his love child, pictures of his mistress, etc.</p>
<p>The difference is real-time (what people are browsing now) vs. historical (what they browsed in the past).  Social vs. Old School.  Check it out and let us know your thoughts.  <a href="http://me.dium.com/search" rel="nofollow">http://me.dium.com/search</a>.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

