Science and Technology Intelligence testing Who are you calling bird-brained?
科技 智力測試 叫誰是笨蛋呢?
An attempt is being made to devise a universal intelligence test
人類正嘗試設計一種通用的智力測驗
WHAT is the IQ of a chimpanzee? Or a worm? Or a game-show-winning computer program? Or even an alien from the planet Zaarg who can learn any human language in a day, can beat grandmasters ten at a time and can instantly factor the products of large prime numbers?
黑猩猩、蠕蟲或用于智力競賽的計算機程序,甚至是來自Zaarg星球的外星人(外星人能在一天內學會任何一門人類語言,每次都能贏棋類大師10分,能立即分解出大質數的因子),它們的智商是多少呢?
At the moment it is impossible to say.
當問到這類問題,沒有人能回答出來。
IQ tests depend on language, and even Watson, a computer program that beat two human contestants in a special edition of "Jeopardy!" (an American quiz show) on February 16th, does not have a perfect command of English.
IQ測試依賴于語言,而且就算是Watson系統對英語也不是了如指掌(2月16日,Watson系統在美國智力競賽"Jeopardy!"的特別節目中擊敗了兩名人類對手)。
In any case there is, at the moment, no meaningful scale on which non-human intelligence can be compared with the human sort.
但不管怎么樣,現在還沒有一個有意義的標尺能用于任何非人類和人類智商的比較。
The most famous test for artificial intelligence is that devised by Alan Turing, a British computing pioneer.
最著名的人工智力測試是由英國計算科學先驅阿蘭圖靈(Alan Turing)設計的。
To pass the Turing test, and thus be considered intelligent, a program must fool a human being into believing that it is another human being.
為了通過圖靈測試,機器中的程序必須使對方(人類)誤以為它也是一個人類,且如果機器通過了測試,該機器就被認為是人工智能。
But the Turing test still requires the program to share a language with the tester and, because it is all or nothing, cannot be used to rank different forms of artificial intelligence against one another.
但圖靈測試仍要求程序和測試者使用相同的語言。并且由于測試的結果是"是"或者"否",因此不能用于不同形式人工智能間的評級。
Jose Hernandez-Orallo of the Polytechnic University of Valencia, in Spain, and David Dowe of Monash University, in Australia, think they can do better than this. They believe not only that a universal scale of intelligence can be devised, but also that it can be assessed without reference to language.If they are right, an insult like "bird-brained" will, in the future, be finely calibrated.
來自西班牙瓦倫西亞理工大學的Jose Hernandez-Orallo和澳大利亞莫納什大學的David Dowe認為他們能設計出比圖靈更完善的測試。他們相信不僅能設計出可通用的智商標尺,而且測試可以與語言無關。如果真能如他們所說,那么將來就可以精確計算出"愚蠢"的程度了。
Dr Hernandez-Orallo and Dr Dowe, both computer scientists, propose to make their measurement by borrowing a concept called Kolmogorov complexity from information theory, a branch of computer science.
Hernandez-Orallo博士和Dr Dowe博士都是計算機科學家,他們提出:借用信息論(計算機科學的一個分支)中的Kolmogorov復雜度概念來實現測量。
The Kolmogorov complexity of a computer's output is the shortest possible program (measured in the binary digits that lie at the bottom of all computer code) that could produce that output.
計算機輸出的Kolmogorov復雜度是指能產生該輸出的可能最短程序(基于計算機代碼的二進制位數)。
On this measure, an entity's intelligence would be measured by the Kolmogorov complexity of the most complex tests it can solve—a clear, numerical value.
在這種設計下,可以通過最復雜測試的Kolmogorov復雜度來測量一個實體的智商。
In practice, calculating the true Kolmogorov complexity of a system is almost impossible.
實際上,要正確計算出系統的Kolmogorov復雜度幾乎是不可能的。
But an approximation can be made. And that, the researchers reckon, will be good enough.
但可以計算出它的近似值。而對研究人員的估算工作而言,近似值也就足夠了。
The actual tests would employ the well-honed methods of operant conditioning, developed initially on pigeons, in which the test subject has first to work out what is going on by trial and error.
實際的測試會采用精確的操作性條件反射方法,測試對象首先要清楚實驗的過程和錯誤。
As in operant conditioning, correct responses would be ewarded—by money, perhaps, for a human being; by bananas for a chimpanzee or by the numerical value itself for an appropriately programmed computer.
在操作性條件反射中,正確的反射響應將得到獎勵——對人而言可能是金錢,對黑猩猩而言是香蕉,對可編程計算機而言則可能是數值本身。
If the test were noughts and crosses, the test-taker (if it had never seen the game before) would first have to work out that the game is won by getting three in a row on a 3-by-3 grid, before actually going on to play.
如果測驗是畫圈叉游戲,則在進行游戲之前測試參與者(假定他們從沒聽說過圈叉游戲)首要要弄懂在3X3的網格中,3個格連成一行就贏得游戲了。
A chimpanzee might not manage a test of this level of complexity, but could, maybe, work out the idea of three-in-a-row when only a single row was involved.
黑猩猩可能沒辦法理解這種復雜度等級的測驗,但當只有一行時,它們也許就可以明白"3格-1行"的概念了。
Chess, though, would surely be beyond it (and probably beyond most people, too, if they did not already know the rules).
而象棋的難易度遠超過黑猩猩的理解程度(可能也超過大多數人的理解,如果他們并不知道象棋規則的話)。
Games like draughts and dominoes would lie somewhere in between.
西洋跳棋和多米諾骨牌這類游戲的難易程度則介于圈叉和象棋之間。
In fact Dr Hernandez-Orallo and Dr Dowe do not plan to use existing games. Instead they are employing a computer to enerate novel games and patterns.
實際上,Hernandez-Orallo博士和Dowe博士計劃采用計算機產生的新游戲和新花樣來代替已有的游戲。
Their approach eliminates human bias.
他們的方法消除了人類的偏見,
It also allows them to generate tests with any level of complexity they like—even ones that are far beyond the ability of humans to complete.
同時也允許產生任何復雜度等級的測驗——甚至可以是遠超過人類能力的。
When it comes to testing the tests, then, aliens from the planet Zaarg will be particularly welcome to apply.
當可以進行測試時,特別歡迎來自Zaarg星球的外星人。