<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Global by Design &#187; Multilingual search</title>
	<atom:link href="http://www.globalbydesign.com/blog/category/multilingual-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.globalbydesign.com</link>
	<description>Adventures in Web Globalization</description>
	<lastBuildDate>Thu, 02 Sep 2010 15:47:19 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
<cloud domain='www.globalbydesign.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>Decyphering Google Translate on your web logs</title>
		<link>http://www.globalbydesign.com/blog/2009/08/08/google-translate-web-logs/</link>
		<comments>http://www.globalbydesign.com/blog/2009/08/08/google-translate-web-logs/#comments</comments>
		<pubDate>Sat, 08 Aug 2009 17:51:55 +0000</pubDate>
		<dc:creator>John Yunker</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Machine Translation]]></category>
		<category><![CDATA[Multilingual search]]></category>

		<guid isPermaLink="false">http://www.globalbydesign.com/?p=1957</guid>
		<description><![CDATA[Whenever I read this site&#8217;s web logs, I&#8217;m always fascinated by the number of referrals via Google Translate. Every month there seems to be more of them, which could mean that the quality of Google Translate is improving, or this site is doing better in the rankings, or some combination of the two. Or, it [...]]]></description>
			<content:encoded><![CDATA[<p>Whenever I read this site&#8217;s web logs, I&#8217;m always fascinated by the number of referrals via Google Translate.</p>
<p>Every month there seems to be more of them, which could mean that the quality of Google Translate is improving, or this site is doing better in the rankings, or some combination of the two. Or, it could be simply be that more people have discovered Google Translate.</p>
<p>Given my passion for <a href="http://bytelevel.com/map/ccTLD.html" target="_blank">country codes</a>, it&#8217;s fair to say that I also enjoy language codes. And it is through language codes that you can figure out what languages users were translating your site &#8220;from&#8221; and &#8220;to.&#8221;</p>
<p>Here is one referral string from my site:</p>
<p><img class="alignnone size-full wp-image-1961" style="border: 1px solid black;" title="google_translate" src="http://www.globalbydesign.com/wp-content/uploads/2009/08/google_translate.jpg" alt="google translate Decyphering Google Translate on your web logs" width="518" height="64" /></p>
<p>First, you can see that the person was using Google Korea, so it&#8217;s fair to say the person was translating from English into Korean. The &#8220;To&#8221; line is actually the blog title post translated into Korean.</p>
<p>That was an easy one.</p>
<p>This next one is a bit more challenging:</p>
<p><img class="alignnone size-full wp-image-1962" style="border: 1px solid black;" title="google_translate2" src="http://www.globalbydesign.com/wp-content/uploads/2009/08/google_translate2.jpg" alt="google translate2 Decyphering Google Translate on your web logs" width="523" height="75" /></p>
<p>This person was using Google.com, so you have to focus on the language codes. There are two here &#8212; an &#8220;id&#8221; (which follows  &#8220;hl=&#8221;) and an &#8220;en&#8221; (which follows &#8220;sl=&#8221;). What this means is the person was translating from English into Indonesian (Bahasa Indonesia).</p>
<p>Here is what the translated page looks like:</p>
<p><img class="alignnone size-full wp-image-1963" style="border: 1px solid black;" title="google_translate2a" src="http://www.globalbydesign.com/wp-content/uploads/2009/08/google_translate2a.jpg" alt="google translate2a Decyphering Google Translate on your web logs" width="500" height="540" /></p>
<p>The quick and easy way to know the target language is to focus on the &#8220;hl=&#8221; string. In the screen shot below, the target language is German.</p>
<p><img class="alignnone size-full wp-image-1964" style="border: 1px solid black;" title="google_translate3" src="http://www.globalbydesign.com/wp-content/uploads/2009/08/google_translate3.jpg" alt="google translate3 Decyphering Google Translate on your web logs" width="523" height="75" /></p>
<p>And here is a <a href="http://www.loc.gov/standards/iso639-2/php/code_list.php" target="_blank">language code reference</a> if you want to study your web logs.</p>
<p>What I want to know is what percentage of web traffic is taken up by Google Translate. Anyone care to share their Web log stats?</p>
<p>Based on my cursory analysis, I would estimate the figure to be between 5% and 10%, but that&#8217;s very rough.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.globalbydesign.com/blog/2009/08/08/google-translate-web-logs/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google vs. Baidu: A User Experience Analysis</title>
		<link>http://www.globalbydesign.com/blog/2008/05/02/google-vs-baidu-a-user-experience-analysis/</link>
		<comments>http://www.globalbydesign.com/blog/2008/05/02/google-vs-baidu-a-user-experience-analysis/#comments</comments>
		<pubDate>Fri, 02 May 2008 01:23:31 +0000</pubDate>
		<dc:creator>Jason Yu</dc:creator>
				<category><![CDATA[China]]></category>
		<category><![CDATA[Culture]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Guest Articles]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Multilingual search]]></category>
		<category><![CDATA[Software Localization]]></category>
		<category><![CDATA[Web Globalization]]></category>

		<guid isPermaLink="false">http://www.globalbydesign.com/?p=1182</guid>
		<description><![CDATA[There are tons of articles about Google vs. Baidu, but few of these articles take an in-depth look at how Google compares to Baidu from a Chinese user&#8217;s perspective. In this article, I do just that, and I render a verdict as to which Web site is better. Search The best way to compare search [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">There are tons of articles about Google vs. Baidu, but few of these articles take an in-depth look at how Google compares to Baidu from a Chinese user&#8217;s perspective. </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">In this article, I do just that, and I render a verdict as to which Web site is better.</span></p>
<p class="MsoNormal" style="0.0001pt;"><strong><span style="Arial;">Search</span></strong></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">The best way to compare search engine quality is to compare searches.</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">I recently input three Chinese keywords for my experiment: </span></p>
<ul>
<li>许霆 (Xu Ting: A Chinese citizen who was recently involved in a controversial criminal case)</li>
<li>次级房贷 (Subprime mortgage)</li>
<li>看羹吃饭 (Kan-Geng-Chi-fan: A phrase used and recognized by a relatively small number of Chinese, meaning that you have to think carefully before taking action)</li>
</ul>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">These keywords represent three different categories of information people search for online. <strong>Xu Ting</strong> is a hot keyword in China at the moment but it has received little international media coverage. <strong>Subprime mortgage</strong>, on the other hand, is a foreign concept and the term has been transliterated into Chinese characters from the English equivalent. <strong>Kan-Geng-Chi-fan</strong> is used within a specific dialect that is not used by the majority of Chinese citizens. </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Okay, here are the results as of April 18, 2008: </span></p>
<p class="MsoNormal" style="0.0001pt;"><strong><span style="Arial;">&#8220;Xu Ting&#8221;</span></strong></p>
<ul>
<li><span style="Arial;"><a href="http://www.baidu.com/s?ie=gb2312&amp;bs=%B4%CE%BC%B6%B7%BF%B4%FB&amp;sr=&amp;z=&amp;cl=3&amp;f=8&amp;wd=%D0%ED%F6%AA&amp;ct=0"><span style="#174dae;">Baidu</span></a>: 2,000,000 results</span></li>
<li><span style="Arial;"><a href="http://www.google.com/search?q=%E8%AE%B8%E9%9C%86&amp;hl=en&amp;start=0&amp;sa=N"><span style="#174dae;">Google.com</span></a>: 1,440,000 results</span></li>
<li><span style="Arial;"><a href="http://www.google.cn/search?q=%E8%AE%B8%E9%9C%86&amp;complete=1&amp;hl=zh-CN&amp;inlang=zh-CN&amp;start=0&amp;sa=N"><span style="#174dae;">Google.cn</span></a>: 1,330,000 results</span></li>
</ul>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">It would seem that Baidu knows much more about Xu Ting than Google, although I did not verify that every result referred to this particular individual. </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Interestingly, in the first results page of both <a href="http://google.com/"><span style="#174dae;">google.com</span></a> and <a href="http://google.cn/"><span style="#174dae;">google.cn</span></a>, one of the search results directed users to <a href="http://www.blogger.com/tieba.baidu.com/f?kw=%D0%ED%F6%AA"><span style="#174dae;">Baidu Post</span></a> &#8212; Baidu&#8217;s popular user forum.</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Overall, I would rate both sites equally because the top 20 results from each search engine were highly qualified and I could easily find information I wanted from there. <strong>Verdict: A tie.</strong></span></p>
<p class="MsoNormal" style="0.0001pt;">
<p class="MsoNormal" style="0.0001pt;"><strong><span style="Arial;">&#8220;Subprime mortgage&#8221; </span></strong></p>
<ul>
<li><span style="Arial;"><a href="http://www.baidu.com/s?ie=gb2312&amp;bs=%D0%ED%F6%AA&amp;sr=&amp;z=&amp;cl=3&amp;f=8&amp;wd=%B4%CE%BC%B6%B7%BF%B4%FB&amp;ct=0"><span style="#174dae;">Baidu</span></a>: 1,050,000 results</span></li>
<li><span style="Arial;"><a href="http://www.google.com/search?hl=en&amp;q=%E6%AC%A1%E7%BA%A7%E6%88%BF%E8%B4%B7&amp;btnG=Search"><span style="#174dae;">Google.com</span></a>: 387,000 results</span></li>
<li><span style="Arial;"><a href="http://www.google.cn/search?aq=f&amp;complete=1&amp;hl=zh-CN&amp;inlang=zh-CN&amp;q=%E6%AC%A1%E7%BA%A7%E6%88%BF%E8%B4%B7&amp;btnG=Google+%E6%90%9C%E7%B4%A2&amp;meta="><span style="#174dae;">Google.cn</span></a>: 1,540,000 results</span></li>
</ul>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">This time <a href="http://google.cn/"><span style="#174dae;">google.cn</span></a> appears to do much better than Baidu. But if we look closely at the top 20 search results, we&#8217;ll find there are 7 results at <a href="http://google.com/"><span style="#174dae;">google.com</span></a> and 5 results at <a href="http://google.cn/"><span style="#174dae;">google.cn</span></a> that direct us to Web sites that use traditional Chinese characters, which are used in Taiwan, Hong Kong and by the overseas Chinese community. </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">It can be rather challenging for the mainland Chinese to read traditional Chinese, though they can understand most of the message. Nonetheless, this mix of simplified and traditional Characters is not the most user-friendly approach. <strong>Verdict: Baidu wins.</strong></span></p>
<p class="MsoNormal" style="0.0001pt;">
<p class="MsoNormal" style="0.0001pt;">
<p class="MsoNormal" style="0.0001pt;"><strong><span style="Arial;">&#8220;Kan Geng Chi Fan&#8221;</span></strong></p>
<ul>
<li><span style="Arial;"><a href="http://www.baidu.com/s?wd=%BF%B4%B8%FE%B3%D4%B7%B9"><span style="#174dae;">Baidu.com</span></a> 207 results</span></li>
<li><span style="Arial;"><a href="http://www.google.com/search?hl=en&amp;q=%E7%9C%8B%E7%BE%B9%E5%90%83%E9%A5%AD&amp;btnG=Google+Search"><span style="#174dae;">Google.com</span></a> 4,000,000 results</span></li>
<li><span style="Arial;"><a href="http://www.google.cn/search?aq=f&amp;complete=1&amp;hl=zh-CN&amp;inlang=zh-CN&amp;q=%E7%9C%8B%E7%BE%B9%E5%90%83%E9%A5%AD&amp;btnG=Google+%E6%90%9C%E7%B4%A2&amp;meta="><span style="#174dae;">Google.cn</span></a> 247,000 results</span></li>
</ul>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">At first glance, Google produced overwhelmingly more information than Baidu. However, if we examine the details, Google did not perform so well. Neither Google.com nor Google.cn produce an accurate search result within the first 10 pages respectively, while all the 207 search results from Baidu are accurate. <strong>Verdict: Baidu wins again.</strong> </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Based on these three searches, Google comes across as a bit complicated and &#8220;foreign&#8221; to Chinese users. Baidu is the superior Chinese search engine.</span></p>
<p class="MsoNormal" style="0.0001pt;">
<p class="MsoNormal" style="0.0001pt;"><strong><span style="Arial;">Products</span></strong></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Both Google and Baidu are trying to leverage their network effects to promote other products. Google has many excellent products, but not every product has performed well in China. For example, Google Maps is widely used by American users. Unfortunately, Google Maps in China is unable to provide the same features due to unavailability of mapping data in China. Google&#8217;s satellite map currently only covers the major Chinese cities. Should Google acquire better maps, it would have a clear advantage over Baidu, which doesn&#8217;t offer the same degree of functionality and usability in its map tool.<br />
</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Although music copyright is a controversial issue within China, the market reality is that millions of Chinese Internet users download free music online. Baidu understands this reality and its music search product &#8212; which presents a list of links for free music downloads when people search by song, singer, or label &#8212; is extremely popular. Google is unable to compete with Baidu in this regard due to its adherence to US copyright laws.</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Another example is Baidu Post, an online forum allowing Internet user to create new topics based on search keywords and provide commentary. When people search online by keyword, they can also follow these keywords to Baidu Post, where they may find additional information &#8212; or at least find out what others think of the selected keywords. </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Online forums are a very important medium in China for distributing information online. I think an important reason for this is because the Chinese, as well as many businesses, want to remain anonymous. While this may change in the years ahead as the next generation embraces social networking sites, for the time being, online forums are dominant. Baidu also offers a blog platform (Hi Baidu) while Google has localized Blogger into Chinese, very few Chinese people currently use it.<br />
</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Local culture and consumer behavior are critical factors in determining whether a product will succeed in an overseas market or not. So far, Google products have not been as appealing as Baidu to Chinese users.</span></p>
<p class="MsoNormal" style="0.0001pt;"><strong><span style="Arial;">The Brand Name</span></strong></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">The name of Baidu (</span><span>百度</span><span style="Arial;">) is from a beautiful Chinese ancient poem:</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;"> </span></p>
<blockquote>
<p class="MsoNormal" style="text-align: left;"><strong><span style="Arial;">Thousands of times</span></strong><span style="Arial;">, I looked for my girl;</span></p>
<p class="MsoNormal" style="text-align: left;"><span style="Arial;">Suddenly, at some point, I stopped and looked back,</span></p>
<p class="MsoNormal" style="text-align: left;"><span style="Arial;">I found she was just over there among a bunch of lanterns.</span></p>
</blockquote>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;"> This poem, written by Qiji Xin, who lived in the Song Dynasty nearly 1000 years ago, is still very popular in China and also taught in high schools. Baidu in Chinese means <strong>thousands of times</strong>. In Chinese culture, this poem communicates one&#8217;s desire to achieve his/her dreams. Obviously, meshes well with the services offered by Baidu, a company that claims it better understands Chinese users and Chinese culture.</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Google started to use its Chinese name Guge (</span><span>谷歌</span><span style="Arial;">) in 2006. Guge (goo-ge) is transliterated from Google and it literally means &#8220;the song of grain&#8221; in Chinese. <a href="http://tech.sina.com.cn/i/2006-04-21/1638913602.shtml"><span style="#174dae;">A survey</span></a> conducted in 2006 shows 84.6% Chinese do not like this name. I think the most important reason is that Chinese people want to feel international and modern. This is also one reason you may see many Chinese companies using English words in their marketing materials, as it creates an international effect. The &#8220;song of grain&#8221; presents an image of the agricultural society that the Chinese people are striving to break away from.</span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">Google has exerted a good deal of effort in localizing its name for China but it has not yet been accepted by the Chinese people. It may take some time. Some companies have chosen to simply use their English names in China, avoiding localization altogether, such as IBM. </span></p>
<p class="MsoNormal" style="0.0001pt;"><span style="Arial;">To sum up, Baidu definitely has an edge over Google in China. But it is early yet and Google has been doing things such as redesigning its Chinese home page, which may resonate with users. The key takeaway here is that every new market is a new challenge; just because you are number one at home does not mean you will be number one in every country you enter. Should Baidu enter the US market some day, it will face many of the same challenges that Google is now facing in China.</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.globalbydesign.com/blog/2008/05/02/google-vs-baidu-a-user-experience-analysis/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>Understanding the global search market</title>
		<link>http://www.globalbydesign.com/blog/2007/10/17/understanding-the-global-search-market/</link>
		<comments>http://www.globalbydesign.com/blog/2007/10/17/understanding-the-global-search-market/#comments</comments>
		<pubDate>Wed, 17 Oct 2007 18:09:36 +0000</pubDate>
		<dc:creator>John Yunker</dc:creator>
				<category><![CDATA[China]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Multilingual search]]></category>
		<category><![CDATA[Russia]]></category>

		<guid isPermaLink="false">http://www.globalbydesign.com/blog/2007/10/17/understanding-the-global-search-market/</guid>
		<description><![CDATA[The fine folks at Multlingual Search have published a report on the state of the search engine markets in more than a dozen countries. This report is a quick read and, best of all, it&#8217;s free. You can download it here. The most interesting takeaway is something I&#8217;ve been writing about on this blog &#8212; [...]]]></description>
			<content:encoded><![CDATA[<p>The fine folks at Multlingual Search have published a report on the state of the search engine markets in more than a dozen countries.</p>
<p>This report is a quick read and, best of all, it&#8217;s free. You can download it <a href="http://www.multilingual-search.com/global-search-report-2007-released-free-pdf/15/10/2007" target="_blank">here</a>.</p>
<p>The most interesting takeaway is something I&#8217;ve been writing about on this blog &#8212; that while Google is by far the leading search engine globally, it still does not lead in a number of key markets, namely China, Russia, Estonia, Czech Republic, and South Korea.</p>
<p>According to this report, Google did not even make the list of top three search engines in South Korea; I&#8217;m wondering if Google&#8217;s <a href="http://www.globalbydesign.com/blog/2007/06/01/google-gets-animated-in-korea/">recent redesign of its Korea site</a> is beginning to help it make up ground.</p>
<p>Somewhat related is a new press release from <a href="http://www.comscore.com/press/release.asp?press=1802" target="_blank">comScore</a> that &#8220;found that more than 750 million people age 15 and older  or 95 percent of the worldwide Internet audience  <strong>conducted 61 billion searches worldwide in August</strong>, an average of more than 80 searches per searcher.&#8221;</p>
<p>Here is the breakdown of these searches by region:</p>
<p><img src="http://www.globalbydesign.com/wp-content/uploads/2007/10/comscore_oct07.gif" alt="comscore oct07 Understanding the global search market " border="1" title="Understanding the global search market " /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.globalbydesign.com/blog/2007/10/17/understanding-the-global-search-market/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
