標籤:

傳譯來自利比亞的民眾之聲

新科學家雜誌:沙漠綠洲之城奧庫法拉赫坐落於利比亞境內的東南遠端深入撒哈拉沙漠,這裡本來不會是外國新聞記者矚目的地方,但是在2月23日一條該城抗議者通過手機傳送的語音信息通過Google提供的語音轉化技術在推特上發表了,這條信息是:「年輕人已經控制了城市,他們升起了被卡扎菲政變推翻的前國家國旗。」這條信息之所以引人注目,是由於在2月23日前在利比亞還沒有一名西方記者存在,而這條消息讓世界上的人們知道了在利比亞發生的情況。這條語音消息是推特從Alive in Libya這個網站摘錄的,而在這個網站的背後是一支志願者大軍,他們負責把阿拉伯語轉譯為英語,而這種技術就是Google近年來所致力的機器與人工混合翻譯技術,這種混合翻譯技術比起純機器翻譯有了質的提高。Crowdsourced translations get the word out from Libya

  • 18:01 25 February 2011 byJim GilesandJacob Aron
  • The message is clear in any language (Image: Sven Torfinn/Panos)

    The oasis town of Al Khufrah lies deep in the Sahara desert in the far south-east of Libya. Lying almost 1000 kilometres from its nearest sizeable neighbour, it is not somewhere foreign journalists tend to visit.

    But on 23 February, news from the town reached the English-speaking world. "Greetings this is an urgent message from Kufra," said the anonymous source. "Young people have taken complete control of the city, they hoisted the flag of Libya and Gaddafi down the flag."

    The message arrived by an ingenious route. It started with a voice message in Arabic left on a phone line operated by Google. Software managing the line published the message on Twitter, from where it was picked up by the websiteAlive in Libya. The tweet went out to Alive"s army of volunteers, who provided an English translation for the site. It is just one of around 170 reports, from videos to tweets to audio recordings, that Alive in Libya has translated since it started on 19 February.

    The site has been an important resource because until 21 February there were no western reporters in Libya, notesAndy Carvin, a social media strategist at National Public Radio in Washington DC. "Their translation work has helped give more credibility to a number of sources, as well as providing reporters and the public with more context on any given situation."

    Fostering debate

    Crowdsourced translation like this is finding a growing number of applications. It was used in Haiti to translate messages sent in the aftermath of last year"s earthquake. "Alive in" projects are also active in Egypt and Bahrain. AtMeedan, a social network aimed at fostering debate between Arabic and English speakers, 9000 registered users help translate 300,000 words every month, says Ed Bice, the site founder. Facebook"s efforts to crowdsource translation of its site were so successful that it has given website owners access to the translation tools it created.

    Crowdsourcing is also being used to improve machine translation services likeGoogle Translate. Software tools like this are useful for quick and dirty translation, but they can make embarrassing mistakes. Try using Google to translate the phrase "Machine translation is good, but not great" from Japanese to English and back again, and the result is "Machine translation is not good, great" – clearly not the same thing. Sometimes the result is simply nonsense. Translating "I could murder a pint" to Spanish and back gets you "I could kill a liter".

    For people who can speak both the source and the destination language, such mistakes are easy to spot, so why not let bilingual humans lend the machines a hand?Adam Lopez, a machine translation researcher at Johns Hopkins University in Baltimore, Maryland, sees such teamwork as a step on the road to a universal translation system that will ensure material posted online can be read in any language. "This is really speculative, but a few decades from now, who knows?" he says.

    The building blocks for such a system are already in place. Posts in Arabic on Meedan are automatically translated into English, and vice versa, and then opened up to the site"s users, who tweak the translations as necessary. Even Google lets its users suggest corrections to translations. "When you refine a translation, we"ll take that and feed it back into the system," says Chewy Trewhella, new business development manager at Google. "Every time a user refines a search they"re helping in some way."

    Translate and learn

    To motivate people to provide translations, some researchers are trying to build an educational component into the process.Luis von Ahn, a computer scientist at Carnegie Mellon University in Pittsburgh, Pennsylvania, is developing a service calledDuolingodesigned to help people to learn a language while also generating useful translations. More details will be available when the service launches around two months from now.

    "The idea that monolingual speakers could improve machine translation started out as a crazy idea," saysPhilip Resnik, a computational linguist at the University of Maryland in College Park. His MonoTrans2 allows users reading the first draft of a computer translation to flag up phrases that appear incorrect. This information is then passed to people reading the text in the original language, inviting them to rephrase problem phrases in a way the machine might better understand. After the computer has had another try, the new phrase is flagged up, prompting readers to judge whether it is correct.

    Resnik tested the system by translating children"s books from Spanish into German. He found the number of sentences rated perfectly fluent and accurate by bilingual evaluators increased from 10 to 68 per cent when compared with Google Translate. It"s not perfect, but it shows what can be achieved when humans and machines work together. "People don"t really care whether the translations they"re getting are coming from a machine or not, as long as they get them when they want them," says Resnik.

    The common language of statistics

    The machine translation revolution began 20 years ago, when a research group at IBM introduced a technique based on statistical analysis. Previous attempts at translation had focused on encoding linguistic knowledge as rules for the computer to follow. But for this new approach, dubbed statistical machine translation (SMT), almost no linguistic knowledge was needed.

    Instead, the SMT system searched a large collection of texts, each of which was in two different languages, searching for statistical patterns that indicate the presence of words and phrases with the same meaning. Google and other translation companies use similar SMT systems today.

    推薦閱讀:

    世界上最有錢的國王去世了...他獨愛王后66年,深受泰國民眾愛戴,不料卻留下了一個狗血淋漓的皇室
    民眾幸福感調查問卷
    火星驚現「金字塔」圖像引發民眾臆測
    烏克蘭將鎮國之寶售華惹俄不滿 烏民眾:寧肯白送中國也不賣俄
    越南民眾回應「中國放水幫抗旱」:此舉充滿善意

    TAG:利比亞 | 民眾 |