更改

添加1,048字节 、 2020年5月10日 (日) 20:20
第198行: 第198行:     
In 1962, John Tukey described a field he called “data analysis,” which resembles modern data science. Later, attendees at a 1992 statistics symposium at the University of Montpellier II acknowledged the emergence of a new discipline focused on data of various origins and forms, combining established concepts and principles of statistics and data analysis with computing.   
 
In 1962, John Tukey described a field he called “data analysis,” which resembles modern data science. Later, attendees at a 1992 statistics symposium at the University of Montpellier II acknowledged the emergence of a new discipline focused on data of various origins and forms, combining established concepts and principles of statistics and data analysis with computing.   
  −
1962年,John Tukey 描述了一个他称之为“数据分析”的领域,类似于现代的数据科学。<ref name=":7">
  −
{{Cite web|url=http://courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf|title=50 years of Data Science|last=Donoho|first=David|date=September 18, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
  −
</ref> 在之后的阶段,参加1992年第二届蒙彼利埃大学统计研讨会的与会者承认了一个新的学科的出现,这个学科专注于各种缘起和形式的数据,将统计和数据分析的既定概念和原则与计算结合起来。<ref>{{Cite book|title=Data science and its applications = La @science des données et ses applications|date=1995|publisher=Academic Press/Harcourt Brace|others=Escoufier, Yves., Hayashi, Chikio (1918-....)., Fichet, Bernard.|year=1995|isbn=0-12-241770-4|location=Tokyo|pages=|oclc=489990740}}</ref><ref>{{Cite journal|last=Murtagh|first=Fionn|last2=Devlin|first2=Keith|date=2018|title=The Development of Data Science: Implications for Education, Employment, Research, and the Data Revolution for Sustainable Development|url=https://www.mdpi.com/2504-2289/2/2/14|journal=Big Data and Cognitive Computing|language=en|volume=2|issue=2|pages=14|doi=10.3390/bdcc2020014|via=|doi-access=free}}</ref> 
  −
  −
  −
        第211行: 第204行:     
The term “data science” has been traced back to 1974, when Peter Naur proposed it as an alternative name for computer science. In 1996, the International Federation of Classification Societies became the first conference to specifically feature data science as a topic. In 1998, Chikio Hayashi argued for data science as a new, interdisciplinary concept, with three aspects: data design, collection, and analysis.   
 
The term “data science” has been traced back to 1974, when Peter Naur proposed it as an alternative name for computer science. In 1996, the International Federation of Classification Societies became the first conference to specifically feature data science as a topic. In 1998, Chikio Hayashi argued for data science as a new, interdisciplinary concept, with three aspects: data design, collection, and analysis.   
  −
“数据科学”这一术语可以追溯到1974年,由彼得·诺尔 Peter Naur 提出作为计算机科学的替代名称。.<ref name=":0">
  −
{{Cite journal|last=CaoLongbing|date=2017-06-29|title=Data Science|journal=ACM Computing Surveys (CSUR)|volume=50|issue=3|pages=1–42|language=EN|doi=10.1145/3076253|doi-access=free}}
  −
</ref> 1996年,国际船级社联合会 the International Federation of Classification Societies成为第一个以数据科学为专题的会议。<ref>
  −
{{Cite web|url=http://www2.isye.gatech.edu/~jeffwu/presentations/datascience.pdf|title=Statistics=Data Science?|last=Wu|first=C.F. Jeff|date=|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
  −
</ref> 1998年,林志雄主张数据科学是一个新的、跨学科的概念,包括数据设计、数据收集和数据分析三个方面。<ref>
  −
{{Cite journal|last=Murtagh|first=Fionn|last2=Devlin|first2=Keith|date=2018|title=The Development of Data Science: Implications for Education, Employment, Research, and the Data Revolution for Sustainable Development|url=https://www.mdpi.com/2504-2289/2/2/14|journal=Big Data and Cognitive Computing|language=en|volume=2|issue=2|pages=14|doi=10.3390/bdcc2020014|via=|doi-access=free}}</ref> 
        第226行: 第212行:     
During the 1990s, popular terms for the process of finding patterns in datasets (which were increasingly large) included “knowledge discovery” and “data mining.”   
 
During the 1990s, popular terms for the process of finding patterns in datasets (which were increasingly large) included “knowledge discovery” and “data mining.”   
  −
在20世纪90年代,在数据集中寻找模式的流行术语(数据集越来越大)包括“知识发现”和“数据挖掘”<ref name=":1">{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/|title=A Very Short History Of Data Science|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref><ref name=":0" /> 
  −
  −
  −
  --[[用户:趣木木|趣木木]]([[用户讨论:趣木木|讨论]])下为旧版关于数据科学的词源演变由来的部分内容  可参考整合并进行填充
      
“数据科学”这一术语在过去的三十年里已经出现在各种语境中,但直到最近才成为一个确定的术语。在早期,1960年它被[https://en.wikipedia.org/wiki/Peter_Naur 彼得·诺尔](Peter Naur)用作[https://en.wikipedia.org/wiki/Computer_science 计算机科学]的代名词。诺尔后来引入了[https://en.wikipedia.org/wiki/Datalogy “数据学”](datalogy)这一术语。
 
“数据科学”这一术语在过去的三十年里已经出现在各种语境中,但直到最近才成为一个确定的术语。在早期,1960年它被[https://en.wikipedia.org/wiki/Peter_Naur 彼得·诺尔](Peter Naur)用作[https://en.wikipedia.org/wiki/Computer_science 计算机科学]的代名词。诺尔后来引入了[https://en.wikipedia.org/wiki/Datalogy “数据学”](datalogy)这一术语。
第245行: 第226行:  
|doi:10.1145/365719.366510
 
|doi:10.1145/365719.366510
 
}}</ref>
 
}}</ref>
在1974年,诺尔出版了《计算机方法简明调查》,在这本书对同时代被广泛应用的数据处理方法的调查中,他很自然地使用了“数据科学”这一术语。
+
 
 +
 
 +
1962年,John Tukey 描述了一个他称之为“数据分析”的领域,类似于现代的数据科学。<ref name=":7">
 +
{{Cite web|url=http://courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf|title=50 years of Data Science|last=Donoho|first=David|date=September 18, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
 +
</ref> 在之后的阶段,参加1992年第二届蒙彼利埃大学统计研讨会的与会者承认了一个新的学科的出现,这个学科专注于各种缘起和形式的数据,将统计和数据分析的既定概念和原则与计算结合起来。<ref>{{Cite book|title=Data science and its applications = La @science des données et ses applications|date=1995|publisher=Academic Press/Harcourt Brace|others=Escoufier, Yves., Hayashi, Chikio (1918-....)., Fichet, Bernard.|year=1995|isbn=0-12-241770-4|location=Tokyo|pages=|oclc=489990740}}</ref><ref>{{Cite journal|last=Murtagh|first=Fionn|last2=Devlin|first2=Keith|date=2018|title=The Development of Data Science: Implications for Education, Employment, Research, and the Data Revolution for Sustainable Development|url=https://www.mdpi.com/2504-2289/2/2/14|journal=Big Data and Cognitive Computing|language=en|volume=2|issue=2|pages=14|doi=10.3390/bdcc2020014|via=|doi-access=free}}</ref> 
 +
 
 +
 
 +
“数据科学”这一术语可以追溯到1974年,在1974年,彼得·诺尔 Peter Naur出版了《计算机方法简明调查》,在这本书对同时代被广泛应用的数据处理方法的调查中,他很自然地使用了“数据科学”这一术语,作为计算机科学的替代名称。.<ref name=":0">
 +
{{Cite journal|last=CaoLongbing|date=2017-06-29|title=Data Science|journal=ACM Computing Surveys (CSUR)|volume=50|issue=3|pages=1–42|language=EN|doi=10.1145/3076253|doi-access=free}}
 +
</ref> 1996年,国际船级社联合会 the International Federation of Classification Societies成为第一个以数据科学为专题的会议。<ref>
 +
{{Cite web|url=http://www2.isye.gatech.edu/~jeffwu/presentations/datascience.pdf|title=Statistics=Data Science?|last=Wu|first=C.F. Jeff|date=|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
 +
</ref>
 +
 
    
在1996年,国际分级社团联盟 (IFCS)的成员在日本神户举行了两年一次的会议,在此,术语“数据科学”在由林知己夫(Chikio Hayashi)
 
在1996年,国际分级社团联盟 (IFCS)的成员在日本神户举行了两年一次的会议,在此,术语“数据科学”在由林知己夫(Chikio Hayashi)
第295行: 第288行:  
</ref>
 
</ref>
 
,在演讲中他将统计学工作描述为数据收集、建模和分析、决策的三个流程。在结论中他首创了现代的、非计算机科学内的“数据科学”术语用法,并提倡统计学应更名为数据科学,统计学家应被称作为数据科学家。
 
,在演讲中他将统计学工作描述为数据收集、建模和分析、决策的三个流程。在结论中他首创了现代的、非计算机科学内的“数据科学”术语用法,并提倡统计学应更名为数据科学,统计学家应被称作为数据科学家。
<ref name="cfjwutk"/>
+
<ref name="cfjwutk"/>之后,他又在1998年纪念印度科学家和统计学家、[https://en.wikipedia.org/wiki/Indian_Statistical_Institute 印度统计学院]创立者[https://en.wikipedia.org/wiki/Prasanta_Chandra_Mahalanobis 马哈拉诺比斯(P.C. Mahalanobis)]的讲座上将同名的演讲作为其系列演讲
之后,他又在1998年纪念印度科学家和统计学家、[https://en.wikipedia.org/wiki/Indian_Statistical_Institute 印度统计学院]创立者[https://en.wikipedia.org/wiki/Prasanta_Chandra_Mahalanobis 马哈拉诺比斯(P.C. Mahalanobis)]的讲座上将同名的演讲作为其系列演讲
   
<ref name="cfjwu02">
 
<ref name="cfjwu02">
 
[http://www.isical.ac.in/~statmath/html/pcm/pcm_recent.html "P.C. Mahalanobis Memorial Lectures, 7th series"].  
 
[http://www.isical.ac.in/~statmath/html/pcm/pcm_recent.html "P.C. Mahalanobis Memorial Lectures, 7th series"].  
第305行: 第297行:  
的第一篇进而发表。
 
的第一篇进而发表。
    +
 +
1998年,林志雄主张数据科学是一个新的、跨学科的概念,包括数据设计、数据收集和数据分析三个方面。<ref>
 +
{{Cite journal|last=Murtagh|first=Fionn|last2=Devlin|first2=Keith|date=2018|title=The Development of Data Science: Implications for Education, Employment, Research, and the Data Revolution for Sustainable Development|url=https://www.mdpi.com/2504-2289/2/2/14|journal=Big Data and Cognitive Computing|language=en|volume=2|issue=2|pages=14|doi=10.3390/bdcc2020014|via=|doi-access=free}}</ref> 
 +
 +
 +
 +
在20世纪90年代,在数据集中寻找模式的流行术语(数据集越来越大)包括“知识发现”和“数据挖掘”<ref name=":1">{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/05/28/a-very-short-history-of-data-science/|title=A Very Short History Of Data Science|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref><ref name=":0" /> 
 +
 +
 +
  --[[用户:趣木木|趣木木]]([[用户讨论:趣木木|讨论]])下为旧版关于数据科学的词源演变由来的部分内容  可参考整合并进行填充
 +
 +
 +
 +
 +
=== Modern usage ===
 +
 +
现代用法
 +
 +
The modern conception of data science as an independent discipline is sometimes attributed to [[William S. Cleveland]].<ref>{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}</ref> In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.<ref name=":1" /> "Data science" became more widely used in the next few years: in 2002, the [[Committee on Data for Science and Technology]] launched ''Data Science Journal.'' In 2003, Columbia University launched ''The Journal of Data Science''.<ref name=":1" /> In 2014, the [[American Statistical Association]]'s Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.<ref>{{Cite news|last=Talley|first=Jill|url=https://magazine.amstat.org/blog/2016/06/01/datascience-2/|title=ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science|date=June 1, 2016|work=Amstat News|access-date=|url-status=live|publisher=American Statistical Association}}</ref>
 +
 +
The modern conception of data science as an independent discipline is sometimes attributed to William S. Cleveland. In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.
 +
 +
 +
 +
The professional title of “data scientist” has been attributed to [[DJ Patil]] and [[Jeff Hammerbacher]] in 2008.<ref>{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}</ref> Though it was used by the [[National Science Board]] in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
 +
 +
The professional title of “data scientist” has been attributed to DJ Patil and Jeff Hammerbacher in 2008. Though it was used by the National Science Board in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.
 +
 +
 +
There is still no consensus on the definition of data science and it is considered by some to be a buzzword.<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref>
 +
 +
There is still no consensus on the definition of data science and it is considered by some to be a buzzword.
 +
 +
 +
 +
 +
 +
数据科学作为一门独立学科的现代概念,一定程度上归功于威廉 · s · 克利夫兰 William S. Cleveland。<ref>
 +
{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
 +
</ref>
 
在2001年,威廉·克利夫兰(William S.Cleveland)在他的文章《数据科学:一个用来扩大统计学领域技术范畴的行动计划》将数据科学作为一门独立学科引入,扩大了统计学的领域并使之包含“数据计算的前沿”,这篇文章发表在2001年4月版的《国际统计评论》(''International Statistical Review / Revue Internationale de Statistique'')的第69卷,第1篇上。
 
在2001年,威廉·克利夫兰(William S.Cleveland)在他的文章《数据科学:一个用来扩大统计学领域技术范畴的行动计划》将数据科学作为一门独立学科引入,扩大了统计学的领域并使之包含“数据计算的前沿”,这篇文章发表在2001年4月版的《国际统计评论》(''International Statistical Review / Revue Internationale de Statistique'')的第69卷,第1篇上。
 
<ref name="cleveland01">
 
<ref name="cleveland01">
第311行: 第343行:  
International Statistical Review / Revue Internationale de Statistique, 21–26.
 
International Statistical Review / Revue Internationale de Statistique, 21–26.
 
</ref>  
 
</ref>  
在他的报告中,克利夫兰建立了他认为数据科学所关心的6个技术领域:多学科调查,数据模型和方法,数据计算,教学方法、工具评估及其理论。
+
在他的报告中,他主张将统计学从理论扩展到技术领域; 因为这将大大改变这个领域,它需要一个新的名称。克利夫兰建立了他认为数据科学所关心的6个技术领域:多学科调查,数据模型和方法,数据计算,教学方法、工具评估及其理论。
    
在2002年4月,国际科学委员会(ICSU):数据科学与技术分会(CODATA)
 
在2002年4月,国际科学委员会(ICSU):数据科学与技术分会(CODATA)
第344行: 第376行:  
. Retrieved 30 June 2013.
 
. Retrieved 30 June 2013.
 
</ref>
 
</ref>
 +
 +
    
在2007年左右,
 
在2007年左右,
第357行: 第391行:  
[https://www.nytimes.com/2009/12/15/science/15books.html "Essays Inspired by Microsoft’s Jim Gray, Who Saw Science Paradigm Shift"].                      ''The New York Times''. Retrieved 2018-04-26.
 
[https://www.nytimes.com/2009/12/15/science/15books.html "Essays Inspired by Microsoft’s Jim Gray, Who Saw Science Paradigm Shift"].                      ''The New York Times''. Retrieved 2018-04-26.
 
</ref>
 
</ref>
 +
 +
 +
2008年,DJ 帕蒂尔 DJ Patil和杰夫哈默巴赫尔 Jeff Hammerbacher被授予“数据科学家”的职称。<ref>
 +
{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}
 +
</ref>尽管美国国家科学委员会(National Science Board)在其2005年的报告《长期数字数据收集: 21世纪的研究和教育成果》(Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century)中使用了这个词,但它泛指管理数字数据收集的任何关键角色。<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
 +
 +
    
在2012年[https://en.wikipedia.org/wiki/Harvard_Business_Review 《哈佛商业评论》]的报道“数据科学家:21世纪最富有魅力的工作”中
 
在2012年[https://en.wikipedia.org/wiki/Harvard_Business_Review 《哈佛商业评论》]的报道“数据科学家:21世纪最富有魅力的工作”中
第399行: 第440行:  
,用来出版有关数据科学和大数据分析方面的原创性工作。2015年9月,[http://www.gfkl.org/welcome/ GfKI]在英国克彻斯特的[https://en.wikipedia.org/wiki/University_of_Essex 埃塞克斯大学]举办的第三届ECDA大会上增设“数据科学社团”。
 
,用来出版有关数据科学和大数据分析方面的原创性工作。2015年9月,[http://www.gfkl.org/welcome/ GfKI]在英国克彻斯特的[https://en.wikipedia.org/wiki/University_of_Essex 埃塞克斯大学]举办的第三届ECDA大会上增设“数据科学社团”。
   −
=== Modern usage ===
  −
  −
现代用法
  −
  −
The modern conception of data science as an independent discipline is sometimes attributed to [[William S. Cleveland]].<ref>{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}</ref> In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.<ref name=":1" /> "Data science" became more widely used in the next few years: in 2002, the [[Committee on Data for Science and Technology]] launched ''Data Science Journal.'' In 2003, Columbia University launched ''The Journal of Data Science''.<ref name=":1" /> In 2014, the [[American Statistical Association]]'s Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.<ref>{{Cite news|last=Talley|first=Jill|url=https://magazine.amstat.org/blog/2016/06/01/datascience-2/|title=ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science|date=June 1, 2016|work=Amstat News|access-date=|url-status=live|publisher=American Statistical Association}}</ref>
  −
  −
The modern conception of data science as an independent discipline is sometimes attributed to William S. Cleveland. In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.
  −
  −
数据科学作为一门独立学科的现代概念,有时归功于威廉 · s · 克利夫兰。在2001年的一篇论文中,他主张将统计学从理论扩展到技术领域; 因为这将大大改变这个领域,它需要一个新的名称。
  −
  −
  −
  −
  −
  −
The professional title of “data scientist” has been attributed to [[DJ Patil]] and [[Jeff Hammerbacher]] in 2008.<ref>{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}</ref> Though it was used by the [[National Science Board]] in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
  −
  −
The professional title of “data scientist” has been attributed to DJ Patil and Jeff Hammerbacher in 2008. Though it was used by the National Science Board in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.
  −
  −
2008年,DJ 帕蒂尔和杰夫哈默巴赫尔被授予“数据科学家”的职称。尽管美国国家科学委员会(National Science Board)在其2005年的报告《长期数字数据收集: 21世纪的研究和教育成果》(Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century)中使用了这个词,但它广泛地提到了管理数字数。
  −
  −
  −
  −
  −
  −
There is still no consensus on the definition of data science and it is considered by some to be a buzzword.<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref>
  −
  −
There is still no consensus on the definition of data science and it is considered by some to be a buzzword.
     −
对于数据科学的定义还没有达成共识,有些人认为这是一个流行词。
+
对于数据科学的定义还没有达成共识,有些人认为这是一个流行词。<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref>
    
==研究内容==
 
==研究内容==
763

个编辑