<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" media="screen" href="/styles/xslt/rss.xslt"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:trackback="http://madskills.com/public/xml/rss/module/trackback/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:c9="http://channel9.msdn.com">
<channel>
	<title>Channel 9 Forums - Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
	<atom:link rel="self" type="application/rss+xml" href="http://channel9.msdn.com/Forums/rss"></atom:link>
	<image>
		<url>http://mschnlnine.vo.llnwd.net/d1/Dev/App_Themes/C9/images/feedimage.png</url>
		<title>Channel 9 Forums - Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
		<link>http://channel9.msdn.com/Forums</link>
	</image>
	<description>Channel 9 keeps you up to date with the latest news and behind the scenes info from Microsoft that developers love to keep up with. From LINQ to SilverLight – Watch videos and hear about all the cool technologies coming and the people behind them.</description>
	<link>http://channel9.msdn.com/Forums</link>
	<language>en</language>
	<pubDate>Wed, 19 Jun 2013 02:28:53 GMT</pubDate>
	<lastBuildDate>Wed, 19 Jun 2013 02:28:53 GMT</lastBuildDate>
	<generator>Rev9</generator>
	<c9:totalResults>5</c9:totalResults>
	<c9:pageCount>-5</c9:pageCount>
	<c9:pageSize>-1</c9:pageSize>
	<item>
		<title>Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
		<description><![CDATA[<p>I'm about to start writing a small search engine within my site and I'm wanting to get more into assisting the user in finding the results they want.<br /><br />I have considered soundex() operations, but heard that they aren't always reliable, since longer more complex soundex() results may be truncated.<br /><br />I've also looked into Levenshtein Distance, but again have problems. Seems like to get a search-engine to find suggestions based upon their levenshtein-distance from the original search-term, you need to have a list of terms waiting to be compared to everybody's
 search term, and that could be nasty on the server forcing a levenshtein equation 1,000&#43; times everytime somebody submits a search.<br /><br />Even if you do the tiresome lookups, you'll get dozens of alternatives for each word - how do you decide which to suggest? Seems almost like I need to store search terms, and the number of their returned results in the database, and do levenshtein-comparisons
 against those, and sort the close ones by their number of last returned results...make sense?<br /><br />So I humble myself and come before you all today <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-5.gif' alt='Wink' /> anybody know how to do this in an efficient way?<br /></p>]]></description>
		<link>http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/261720#261720</link>
		<pubDate>Wed, 19 Mar 2008 14:49:24 GMT</pubDate>
		<guid isPermaLink="false">http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/261720#261720</guid>
		<dc:creator>jsampsonPC</dc:creator>
		<slash:comments>5</slash:comments>
		<wfw:commentRss>http://channel9.msdn.com/Niners/jsampsonPC/Discussions/RSS</wfw:commentRss>
	</item>
	<item>
		<title>Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
		<description><![CDATA[<p>I had to implement something similar not too long ago. Had to also do % matching as well. In my case though, I couldn't change the underlying database and I was lucky enough to be able to narrow results down to around 100 or so before having to calculate
 it. I ended up settling on a combination of Levenshtein and Double-Metaphone.<br /><br />If you can add a column to the underlying table, then you might want to look at just calculating the double-metaphone and storing the hashes it creates in&nbsp;a couple&nbsp;columns. Then you can compare on the double-metaphone hashes in SQL vs. calculating on the fly.<br /><br />Here's an article where a guy did it and stored the hash as a UDT.<br /><a href="http://www.sqlservercentral.com/articles/.Net/doublemetaphonephonecticmatching/2063/">http://www.sqlservercentral.com/articles/.Net/doublemetaphonephonecticmatching/2063/</a><br /><br /></p>]]></description>
		<link>http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/78862234002a488eb8409dfa00c1c99c#78862234002a488eb8409dfa00c1c99c</link>
		<pubDate>Wed, 19 Mar 2008 17:45:27 GMT</pubDate>
		<guid isPermaLink="false">http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/78862234002a488eb8409dfa00c1c99c#78862234002a488eb8409dfa00c1c99c</guid>
		<dc:creator>Antitorgo</dc:creator>
		<slash:comments>5</slash:comments>
		<wfw:commentRss>http://channel9.msdn.com/Niners/Antitorgo/Discussions/RSS</wfw:commentRss>
	</item>
	<item>
		<title>Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
		<description><![CDATA[<p>levenshtein is not only useful on words, and there are improved soundex ideas about. You will have a lot less sounds codes than words <img src='http://ecn.channel9.msdn.com/o9/content/images/emoticons/emotion-1.gif' alt='Smiley' /> I'd write more but on phone at the minute.</p>]]></description>
		<link>http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/448123a16ef84d3395e29dfa00c1ca12#448123a16ef84d3395e29dfa00c1ca12</link>
		<pubDate>Wed, 19 Mar 2008 21:01:47 GMT</pubDate>
		<guid isPermaLink="false">http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/448123a16ef84d3395e29dfa00c1ca12#448123a16ef84d3395e29dfa00c1ca12</guid>
		<dc:creator>Rossj</dc:creator>
		<slash:comments>5</slash:comments>
		<wfw:commentRss>http://channel9.msdn.com/Niners/Rossj/Discussions/RSS</wfw:commentRss>
	</item>
	<item>
		<title>Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
		<description><![CDATA[<p>Use a mixture of Soundex (or a variant) and the levenstein function. Essentially the Soundex function tries to combat a person not knowing how a word is spelt (but knowing how it sounds) and a levenstein distance algorithm tries to combat typos.<br /></p>]]></description>
		<link>http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/df55de9f905d406990109dfa00c1ca85#df55de9f905d406990109dfa00c1ca85</link>
		<pubDate>Thu, 20 Mar 2008 02:25:04 GMT</pubDate>
		<guid isPermaLink="false">http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/df55de9f905d406990109dfa00c1ca85#df55de9f905d406990109dfa00c1ca85</guid>
		<dc:creator>evildictaitor</dc:creator>
		<slash:comments>5</slash:comments>
		<wfw:commentRss>http://channel9.msdn.com/Niners/evildictaitor/Discussions/RSS</wfw:commentRss>
	</item>
	<item>
		<title>Tech Off - &amp;quot;Searching &#39;Samson&#39;......did you mean &#39;Sampson&#39;?&amp;quot;</title>
		<description><![CDATA[<p>Googling for Double metaphone will probably find you a better soundex - find the matching codes (primary and secondary for double metaphone, primary for metaphone), weight them according to popularity (if you have the info) and then order according to
 edit distance.
<div><br /></div>
<div>Don't forget to stem the words (look for an alternative to the porter stemmer if you have time), and try and add more weight to proper nouns and nouns, a bit of weight to verbs and a negative discriminating weight to determiners.</div>
<div><br /></div>
<div>Then add a neural net so that you can track the best matches (based on which links the user clicked) and feedback to improve accuracy in future searches.</div></p>]]></description>
		<link>http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/7fe2d283d93d4c6b96249dfa00c1cafe#7fe2d283d93d4c6b96249dfa00c1cafe</link>
		<pubDate>Thu, 20 Mar 2008 14:23:22 GMT</pubDate>
		<guid isPermaLink="false">http://channel9.msdn.com/Forums/TechOff/261720-quotSearching-Samsondid-you-mean-Sampsonquot/7fe2d283d93d4c6b96249dfa00c1cafe#7fe2d283d93d4c6b96249dfa00c1cafe</guid>
		<dc:creator>Rossj</dc:creator>
		<slash:comments>5</slash:comments>
		<wfw:commentRss>http://channel9.msdn.com/Niners/Rossj/Discussions/RSS</wfw:commentRss>
	</item>
</channel>
</rss>