ImageNet

ImageNet項目是一個大型視覺數據庫，用於視覺目標識別軟件研究。該項目已手動注釋了1400多萬張圖像^[1]^[2]，以指出圖片中的對象，並在至少100萬張圖像中提供了邊框^[3]。ImageNet包含2萬多個典型類別^[2]，例如「氣球」或「草莓」，每一類包含數百張圖像^[4]。儘管實際圖像不歸ImageNet所有，但可以直接從ImageNet免費獲得標註的第三方圖像URL^[5]。2010年以來，ImageNet項目每年舉辦一次軟件競賽，即ImageNet大規模視覺識別挑戰賽（ILSVRC）。挑戰賽使用1000個「整理」後的非重疊類^[6]，軟件程序比賽正確分類和檢測目標及場景。

歷史

AI研究員李飛飛從2006年開始研究ImageNet的想法。在大多數AI研究專注於模型和算法的時候，李飛飛則希望擴展和改進可用於訓練AI算法的數據^[7]。2007年，李飛飛與普林斯頓大學教授克里斯蒂安·費爾鮑姆（英語：Christiane Fellbaum）會面討論了該項目，他是WordNet的創建者之一。之後李繼續從WordNet的單詞數據庫開始構建ImageNet，並使用了其許多功能^[8]。作為普林斯頓大學的助理教授，李飛飛組建了一個研究團隊，致力於ImageNet項目。他們使用Amazon Mechanical Turk來幫助分類圖像^[8]。他們在2009年美國佛羅里達州舉行的計算機視覺與模式識別會議上首次以學術海報的形式展示了自己的數據庫^[8]^[9]^[10]。

ImageNet挑戰賽

ILSVRC旨在延續2005年起舉辦的較小規模的PASCAL VOC挑戰賽，後者僅包含約2萬張圖像和20個對象類別^[6]。為了使ImageNet「民主化」，李飛飛向PASCAL VOC團隊提出了一項合作，從2010年開始，研究團隊將在給定的數據集上評估他們的算法，並在幾項視覺識別任務上爭奪更高的準確率^[8]。由此產生的年度競賽現在稱為ImageNet大規模視覺識別挑戰賽（ILSVRC）。ILSVRC使用僅1000個「整理後的」圖像類別——例如完整的ImageNet類別中，狗的類別共有120種，而在「整理後的」圖像類別中，包括了120個犬種中的90個^[6]。

2010年代，圖像處理取得了巨大進步。2011年，良好的ILSVRC分類錯誤率為25%。2012年，AlexNet深層卷積神經網絡達到了15.3%的錯誤率，比第二名低10.8個百分點^[11]。在接下來的幾年中，錯誤率下降到百分之幾^[12]。儘管2012年的突破是「結合了之前有過的組件」，但大幅量化的改進標誌着全行業人工智能熱潮的開始^[4]。2015年，微軟的研究人員報告稱，他們的卷積神經網絡在ILSVRC任務中超過了人類水平，並贏得了當年的ImageNet挑戰賽^[13]^[14]。但是，正如挑戰賽的組織者之一奧爾加·盧薩科夫斯基（英語：Olga Russakovsky）在2015年指出的那樣，這些程序只需要識別出圖像屬於一千個類別中的哪一個即可，而人類可以識別更多類別，並且還可以判斷圖像的上下文^[15]。

到2014年，超過50家機構參加了ILSVRC^[6]。2015年，百度科學家因使用不同的帳號提交，大大超過了每周兩次的提交限制，而被禁止參加比賽一年^[16]^[17]。百度隨後表示已解僱相關團隊的負責人，並將建立一個科學顧問小組^[18]。

2017年，38個參賽團隊中有29個的錯誤率低於5%^[19]。2017年，ImageNet表示將在2018年推出一個新的、難度更大的挑戰賽，其中涉及使用自然語言對三維對象進行分類。由於創建三維數據比標註現有二維圖像的成本更高，因此預計數據集會更小。這方面的進展應用範圍從機器人導航到增強現實^[1]。

數據集

ImageNet通過眾包進行注釋。圖像級注釋表明圖像中是否存在目標類別，例如「此圖像中有老虎」或「此圖像中沒有老虎」。對象級注釋為對象（的可見部分）周圍提供了一個邊界框。ImageNet使用寬泛的WordNet模式的變體對目標進行分類，並增加了120個犬種類別，以顯示細粒度分類^[6]。2012年，ImageNet是全球最大的Mechanical Turk學術用戶，其雇用的普通工人每分鐘可以識別50張圖像^[2]。

ImageNet中的偏差

2019年對ImageNet和WordNet的多個層面（分類學，目標類別和標籤）的歷史進行的研究表明了用於各種圖像的大多數分類方法如何嵌入了偏見^[20]^[21]^[22]。ImageNet正在努力解決各種來源的偏見^[23]。

參見

參考資料

^ ^1.0 ^1.1 New computer vision challenge wants to teach robots to see in 3D. New Scientist. 2017-04-07 [2018-02-03]. （原始內容存檔於2018-10-30）.
^ ^2.0 ^2.1 ^2.2 Markoff, John. For Web Images, Creating New Technology to Seek and Find. The New York Times. 2012-11-19 [2018-02-03]. （原始內容存檔於2019-02-16）.
^ ImageNet Summary and Statistics. ImageNet. [2016-06-22]. （原始內容存檔於2019-03-20）.
^ ^4.0 ^4.1 From not working to neural networking. The Economist. 2016-06-25 [2018-02-03]. （原始內容存檔於2016-12-31）.
^ ImageNet Overview. ImageNet. [2016-06-22]. （原始內容存檔於2016-07-04）.
^ ^6.0 ^6.1 ^6.2 ^6.3 ^6.4 Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei. (* = equal contribution) ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.
^ Hempel, Jesse. Fei-Fei Li's Quest to Make AI Better for Humanity. Wired. 2018-11-13 [2019-05-05]. （原始內容存檔於2018-12-06）. When Li, who had moved back to Princeton to take a job as an assistant professor in 2007, talked up her idea for ImageNet, she had a hard time getting faculty members to help out. Finally, a professor who specialized in computer architecture agreed to join her as a collaborator.
^ ^8.0 ^8.1 ^8.2 ^8.3 Gershgorn, Dave. The data that transformed AI research—and possibly the world. Quartz. Atlantic Media Co. 2017-07-26 [2017-07-26]. （原始內容存檔於2017-07-27）. Having read about WordNet's approach, Li met with professor Christiane Fellbaum, a researcher influential in the continued work on WordNet, during a 2006 visit to Princeton.
^ Deng, Jia; Dong, Wei; Socher, Richard; Li, Li-Jia; Li, Kai; Fei-Fei, Li, ImageNet: A Large-Scale Hierarchical Image Database (PDF), 2009 conference on Computer Vision and Pattern Recognition, 2009 [2020-01-15], （原始內容存檔 (PDF)於2021-01-15）
^ Li, Fei-Fei, How we're teaching computers to understand pictures, [2018-12-16], （原始內容存檔於2018-11-16）
^ Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. ImageNet classification with deep convolutional neural networks (PDF). Communications of the ACM. June 2017, 60 (6): 84–90 [2017-05-24]. ISSN 0001-0782. doi:10.1145/3065386. （原始內容存檔 (PDF)於2017-05-16）.
^ Robbins, Martin. Does an AI need to make love to Rembrandt's girlfriend to make art?. The Guardian. 2016-05-06 [2016-06-22]. （原始內容存檔於2016-06-17）.
^ He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing; Sun, Jian. Deep Residual Learning for Image Recognition.. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016: 770–778. ISBN 978-1-4673-8851-1. arXiv:1512.03385 . doi:10.1109/CVPR.2016.90.
^ Markoff, John. A Learning Advance in Artificial Intelligence Rivals Human Abilities. The New York Times. 2015-12-10 [2016-06-22]. （原始內容存檔於2016-04-17）.
^ Aron, Jacob. Forget the Turing test – there are better ways of judging AI. New Scientist. 2015-09-21 [2016-06-22]. （原始內容存檔於2016-04-13）.
^ Markoff, John. Computer Scientists Are Astir After Baidu Team Is Barred From A.I. Competition. The New York Times. 2015-06-03 [2016-06-22]. （原始內容存檔於2016-05-23）.
^ Chinese search giant Baidu disqualified from AI test. BBC News. 2015-06-14 [2016-06-22]. （原始內容存檔於2016-08-17）.
^ Baidu fires researcher involved in AI contest flap. PC World. 2015-06-11 [2016-06-22]. （原始內容存檔於2016-08-28）.
^ Gershgorn, Dave. The Quartz guide to artificial intelligence: What is it, why is it important, and should we be afraid?. Quartz. 2017-09-10 [2018-02-03]. （原始內容存檔於2018-02-02）.
^ The Viral App That Labels You Isn't Quite What You Think. Wired. [2019-09-22]. ISSN 1059-1028. （原始內容存檔於2019-09-22）.
^ Wong, Julia Carrie. The viral selfie app ImageNet Roulette seemed fun – until it called me a racist slur. The Guardian. 2019-09-18 [2019-09-22]. ISSN 0261-3077. （原始內容存檔於2019-09-21）.
^ Crawford, Kate; Paglen, Trevor. Excavating AI: The Politics of Training Sets for Machine Learning. -. 2019-09-19 [2019-09-22]. （原始內容存檔於2019-09-21）.
^ Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in the ImageNet Hierarchy. image-net.org. 2019-09-17 [2019-09-22]. （原始內容存檔於2019-09-22）.

外部連結

官方網站（英文）

[New_Scientist-1] 1.0 ^1.1 New computer vision challenge wants to teach robots to see in 3D. New Scientist. 2017-04-07 [2018-02-03]. （原始內容存檔於2018-10-30）.

[nytimes_2012-2] 2.0 ^2.1 ^2.2 Markoff, John. For Web Images, Creating New Technology to Seek and Find. The New York Times. 2012-11-19 [2018-02-03]. （原始內容存檔於2019-02-16）.

[3] ImageNet Summary and Statistics. ImageNet. [2016-06-22]. （原始內容存檔於2019-03-20）.

[economist-4] 4.0 ^4.1 From not working to neural networking. The Economist. 2016-06-25 [2018-02-03]. （原始內容存檔於2016-12-31）.

[5] ImageNet Overview. ImageNet. [2016-06-22]. （原始內容存檔於2016-07-04）.

[ILJVRC-2015-6] 6.0 ^6.1 ^6.2 ^6.3 ^6.4 Olga Russakovsky*, Jia Deng*, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg and Li Fei-Fei. (* = equal contribution) ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.

[WiredQuest-7] Hempel, Jesse. Fei-Fei Li's Quest to Make AI Better for Humanity. Wired. 2018-11-13 [2019-05-05]. （原始內容存檔於2018-12-06）. When Li, who had moved back to Princeton to take a job as an assistant professor in 2007, talked up her idea for ImageNet, she had a hard time getting faculty members to help out. Finally, a professor who specialized in computer architecture agreed to join her as a collaborator.

[Gershgorn-8] 8.0 ^8.1 ^8.2 ^8.3 Gershgorn, Dave. The data that transformed AI research—and possibly the world. Quartz. Atlantic Media Co. 2017-07-26 [2017-07-26]. （原始內容存檔於2017-07-27）. Having read about WordNet's approach, Li met with professor Christiane Fellbaum, a researcher influential in the continued work on WordNet, during a 2006 visit to Princeton.

[9] Deng, Jia; Dong, Wei; Socher, Richard; Li, Li-Jia; Li, Kai; Fei-Fei, Li, ImageNet: A Large-Scale Hierarchical Image Database (PDF), 2009 conference on Computer Vision and Pattern Recognition, 2009 [2020-01-15], （原始內容存檔 (PDF)於2021-01-15）

[10] Li, Fei-Fei, How we're teaching computers to understand pictures, [2018-12-16], （原始內容存檔於2018-11-16）

[alexnet-11] Krizhevsky, Alex; Sutskever, Ilya; Hinton, Geoffrey E. ImageNet classification with deep convolutional neural networks (PDF). Communications of the ACM. June 2017, 60 (6): 84–90 [2017-05-24]. ISSN 0001-0782. doi:10.1145/3065386. （原始內容存檔 (PDF)於2017-05-16）.

[12] Robbins, Martin. Does an AI need to make love to Rembrandt's girlfriend to make art?. The Guardian. 2016-05-06 [2016-06-22]. （原始內容存檔於2016-06-17）.

[microsoft2015-13] He, Kaiming; Zhang, Xiangyu; Ren, Shaoqing; Sun, Jian. Deep Residual Learning for Image Recognition.. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016: 770–778. ISBN 978-1-4673-8851-1. arXiv:1512.03385 . doi:10.1109/CVPR.2016.90.

[14] Markoff, John. A Learning Advance in Artificial Intelligence Rivals Human Abilities. The New York Times. 2015-12-10 [2016-06-22]. （原始內容存檔於2016-04-17）.

[15] Aron, Jacob. Forget the Turing test – there are better ways of judging AI. New Scientist. 2015-09-21 [2016-06-22]. （原始內容存檔於2016-04-13）.

[16] Markoff, John. Computer Scientists Are Astir After Baidu Team Is Barred From A.I. Competition. The New York Times. 2015-06-03 [2016-06-22]. （原始內容存檔於2016-05-23）.

[17] Chinese search giant Baidu disqualified from AI test. BBC News. 2015-06-14 [2016-06-22]. （原始內容存檔於2016-08-17）.

[18] Baidu fires researcher involved in AI contest flap. PC World. 2015-06-11 [2016-06-22]. （原始內容存檔於2016-08-28）.

[19] Gershgorn, Dave. The Quartz guide to artificial intelligence: What is it, why is it important, and should we be afraid?. Quartz. 2017-09-10 [2018-02-03]. （原始內容存檔於2018-02-02）.

[20] The Viral App That Labels You Isn't Quite What You Think. Wired. [2019-09-22]. ISSN 1059-1028. （原始內容存檔於2019-09-22）.

[21] Wong, Julia Carrie. The viral selfie app ImageNet Roulette seemed fun – until it called me a racist slur. The Guardian. 2019-09-18 [2019-09-22]. ISSN 0261-3077. （原始內容存檔於2019-09-21）.

[22] Crawford, Kate; Paglen, Trevor. Excavating AI: The Politics of Training Sets for Machine Learning. -. 2019-09-19 [2019-09-22]. （原始內容存檔於2019-09-21）.

[23] Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in the ImageNet Hierarchy. image-net.org. 2019-09-17 [2019-09-22]. （原始內容存檔於2019-09-22）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

閱論編標準測試項目
全字母句參考實現健全性測試標準測試圖像
人工智能	中文房間圖靈測試
電視（檢驗圖）	彩條信號印第安人頭檢驗圖測試卡F（英語：Test Card F）飛利浦PM5544
計算機語言	「你好，世界」程序自產生程式特拉百·帕爾多-克努斯算法（英語：Trabb Pardo–Knuth algorithm）編譯器遞歸測試 JAPH
數據壓縮	卡爾加里語料庫（英語：Calgary corpus）坎特伯雷語料庫（英語：Canterbury corpus）
三維計算機圖形	康奈爾盒子（英語：Cornell box）斯坦福兔子斯坦福龍（英語：Stanford dragon）猶他茶壺
機器學習	ImageNet MNIST數據庫列表（英語：List of datasets for machine learning research）
字體排印學	Hamburgevons（英語：Hamburgevons） Lorem ipsum The quick brown fox jumps over the lazy dog 我能吞下玻璃而不傷身體
其他	EICAR測試文件 GTUBE 哈佛語句（英語：Harvard sentences）萊娜圖〈Tom's Diner〉 SMPTE通用片頭（英語：film leader）圓圈星座防偽技術振動試驗（英語：Shakedown (testing)） Bad_Apple!!