更改

删除5,129字节 、 2020年5月14日 (四) 09:33
第208行: 第208行:  
=== Modern usage ===
 
=== Modern usage ===
   −
现代用法
+
'''现代用法'''
    
The modern conception of data science as an independent discipline is sometimes attributed to [[William S. Cleveland]].<ref>{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}</ref> In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.<ref name=":1" /> "Data science" became more widely used in the next few years: in 2002, the [[Committee on Data for Science and Technology]] launched ''Data Science Journal.'' In 2003, Columbia University launched ''The Journal of Data Science''.<ref name=":1" /> In 2014, the [[American Statistical Association]]'s Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.<ref>{{Cite news|last=Talley|first=Jill|url=https://magazine.amstat.org/blog/2016/06/01/datascience-2/|title=ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science|date=June 1, 2016|work=Amstat News|access-date=|url-status=live|publisher=American Statistical Association}}</ref>
 
The modern conception of data science as an independent discipline is sometimes attributed to [[William S. Cleveland]].<ref>{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}</ref> In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.<ref name=":1" /> "Data science" became more widely used in the next few years: in 2002, the [[Committee on Data for Science and Technology]] launched ''Data Science Journal.'' In 2003, Columbia University launched ''The Journal of Data Science''.<ref name=":1" /> In 2014, the [[American Statistical Association]]'s Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.<ref>{{Cite news|last=Talley|first=Jill|url=https://magazine.amstat.org/blog/2016/06/01/datascience-2/|title=ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science|date=June 1, 2016|work=Amstat News|access-date=|url-status=live|publisher=American Statistical Association}}</ref>
   −
 
+
数据科学作为一门独立学科的现代概念,一定程度上归功于威廉 · s · 克利夫兰(William S. Cleveland)。<ref>
 
  −
 
  −
The professional title of “data scientist” has been attributed to [[DJ Patil]] and [[Jeff Hammerbacher]] in 2008.<ref>{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}</ref> Though it was used by the [[National Science Board]] in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
  −
 
  −
 
  −
 
  −
There is still no consensus on the definition of data science and it is considered by some to be a buzzword.<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref>
  −
 
  −
 
  −
 
  −
 
  −
 
  −
 
  −
 
  −
数据科学作为一门独立学科的现代概念,一定程度上归功于威廉 · s · 克利夫兰 William S. Cleveland。<ref>
   
{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
 
{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}
</ref>  
+
</ref> 在2001年的一篇论文中,他主张将统计学从理论扩展到技术领域;由于这将大大改变这个领域,因此它需要一个新的名称。在接下来的几年里,“数据科学”的应用更加广泛:2002年,数据科学与技术分会(CODATA)
在2001年,威廉·克利夫兰(William S.Cleveland)在他的文章《数据科学:一个用来扩大统计学领域技术范畴的行动计划》将数据科学作为一门独立学科引入,扩大了统计学的领域并使之包含“数据计算的前沿”,这篇文章发表在2001年4月版的《国际统计评论》(''International Statistical Review / Revue Internationale de Statistique'')的第69卷,第1篇上。
  −
<ref name="cleveland01">
  −
Cleveland, W. S. (2001).
  −
[https://pdfs.semanticscholar.org/915c/d8e2b39eb02723553913d592b2237d4d9960.pdf Data science: an action plan for expanding the technical areas of the field of statistics].
  −
International Statistical Review / Revue Internationale de Statistique, 21–26.
  −
</ref>
  −
在他的报告中,他主张将统计学从理论扩展到技术领域; 因为这将大大改变这个领域,它需要一个新的名称。克利夫兰建立了他认为数据科学所关心的6个技术领域:多学科调查,数据模型和方法,数据计算,教学方法、工具评估及其理论。
  −
 
  −
 
  −
在2002年4月,国际科学委员会(ICSU):数据科学与技术分会(CODATA)
   
<ref name="ics12">
 
<ref name="ics12">
 
International Council for Science : Committee on Data for Science and Technology. (2012, April).  
 
International Council for Science : Committee on Data for Science and Technology. (2012, April).  
 
CODATA, The Committee on Data for Science and Technology. Retrieved from International Council for Science : Committee on Data for Science and Technology: http://www.codata.org/
 
CODATA, The Committee on Data for Science and Technology. Retrieved from International Council for Science : Committee on Data for Science and Technology: http://www.codata.org/
 
</ref>
 
</ref>
创办了数据科学期刊(''Data Science Journal''
+
创办了数据科学期刊(''Data Science Journal'')。
 
<ref name="dsj12">
 
<ref name="dsj12">
 
Data Science Journal. (2012, April).  
 
Data Science Journal. (2012, April).  
 
Available Volumes.  
 
Available Volumes.  
 
Retrieved from Japan Science and Technology Information Aggregator, Electronic: http://www.jstage.jst.go.jp/browse/dsj/_vols
 
Retrieved from Japan Science and Technology Information Aggregator, Electronic: http://www.jstage.jst.go.jp/browse/dsj/_vols
</ref>
+
</ref>2014年,美国统计协会统计学习与数据挖掘部更名为统计学习与数据科学部,这反映了数据科学的日益普及。
,这是一份聚焦于诸如数据系统描述及其网络出版物、应用和法律问题的出版物
  −
<ref name="dsj02">
  −
Data Science Journal. (2002, April).
  −
Contents of Volume 1, Issue 1, April 2002.
  −
Retrieved from Japan Science and Technology Information Aggregator,
  −
Electronic: http://www.jstage.jst.go.jp/browse/dsj/1/0/_contents
  −
</ref>
  −
。之后不久,哥伦比亚大学在2003年1月开始出版数据科学期刊(''The Journal of Data Science'')
  −
<ref name="jds03">
  −
The Journal of Data Science. (2003, January).
  −
Contents of Volume 1, Issue 1, January 2003.
  −
Retrieved from http://www.jds-online.com/v1-1
  −
</ref>
  −
,为所有数据工作者提供了发表意见和交流想法的平台。这份期刊衷心致力于统计学方法应用和定量研究。在2005年,美国国家科学委员会出版了“长期数字数据收集:赋能21世纪的研究和教育”这一内容,注明数据科学家为“信息和计算机科学家、数据库和软件程序员、学科专家、管理者和注解专家、图书管理员、档案保管员,以及其它对数字化数据收集的成功管理起到关键性作用的人。”他们的首要活动是“进行创造性探究与分析。”
  −
<ref>
  −
National Science Board.
  −
[http://www.nsf.gov/pubs/2005/nsb0540/ Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century]
  −
. National Science Foundation
  −
. Retrieved 30 June 2013.
  −
</ref>
        −
在2007年左右,
+
The professional title of “data scientist” has been attributed to [[DJ Patil]] and [[Jeff Hammerbacher]] in 2008.<ref>{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}</ref> Though it was used by the [[National Science Board]] in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
<ref>
  −
Citation needed
  −
</ref>
  −
图灵奖得主[https://en.wikipedia.org/wiki/Jim_Gray_(computer_scientist) 吉姆·格雷](Jim Gray)预见到使用大数据的分析计算作为主要科学方法的“数据驱动的科学”将成为科学的第四范式
  −
<ref name="TansleyTolle2009" />
  −
<ref name="BellHey2009" />
  −
,我们将迎来一个科学文献、科学数据全部链接且彼此运用的世界。
  −
<ref>
  −
Markoff,John(2009-12-14).
  −
[https://www.nytimes.com/2009/12/15/science/15books.html "Essays Inspired by Microsoft’s Jim Gray, Who Saw Science Paradigm Shift"].                     ''The New York Times''. Retrieved 2018-04-26.
  −
</ref>
      +
2008年,DJ Patil和Jeff Hammerbacher提出了“数据科学家”这个职称名词。<ref>
 +
{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}
 +
</ref>尽管美国国家科学委员会(National Science Board)在其2005年的报告《长期数字数据收集: 促进21世纪的研究和教育》(Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century)中使用了这个词,但它泛指在数字化数据收集管理中起关键性作用的人。<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
   −
2008年,DJ 帕蒂尔 DJ Patil和杰夫哈默巴赫尔 Jeff Hammerbacher被授予“数据科学家”的职称。<ref>
  −
{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}
  −
</ref>尽管美国国家科学委员会(National Science Board)在其2005年的报告《长期数字数据收集: 21世纪的研究和教育成果》(Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century)中使用了这个词,但它泛指管理数字数据收集的任何关键角色。<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
      +
There is still no consensus on the definition of data science and it is considered by some to be a buzzword.<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref>
   −
在2012年[https://en.wikipedia.org/wiki/Harvard_Business_Review 《哈佛商业评论》]的报道“数据科学家:21世纪最富有魅力的工作”中
+
对于数据科学的定义还没有达成共识,有些人认为这是一个时髦的词。
<ref name="Harvard">
  −
{{Cite journal
  −
|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century/
  −
|title=Data Scientist: The Sexiest Job of the 21st Century
  −
|publisher=Harvard Business Review
  −
|first=Thomas H.
  −
|last=Davenport
  −
|first2=DJ
  −
|last2=Patil
  −
|date=Oct 2012
  −
}}</ref>
  −
,[https://en.wikipedia.org/wiki/DJ_Patil 帕蒂尔](DJ Patil)声称其已于2008年和[https://en.wikipedia.org/wiki/Jeff_Hammerbacher 杰弗·哈梅巴赫](Jeff Hammerbacher)共同创造了这一术语,用以标注他们在领英和脸书上的职业信息。他断言数据科学家将是一种全新的职业类型,并且数据科学家的短缺正成为某些领域的严重掣肘,但同时也将其描述为一个更加商业化导向的角色。
        −
2013年,IEEE数据科学和高等分析专门工作组
+
帕蒂尔](DJ Patil)声称其已于2008年和[https://en.wikipedia.org/wiki/Jeff_Hammerbacher 杰弗·哈梅巴赫](Jeff Hammerbacher)共同创造了这一术语
<ref>
+
统计学习和数据挖掘部门将其期刊更名为“统计分析与数据挖掘:ASA数据科学期刊”,并在2016年将其部门更名为“统计学习与数据科学”。
[http://www.dsaa.co "IEEE Task Force on Data Science and Advanced Analytics"]
  −
</ref>
  −
成立,同年第一届“欧洲数据分析大会(ECDA)”在卢森堡召开,会上成立了[http://euads.org/ 欧洲数据科学协会](EuADS)。第一届国际会议——IEEE国际数据科学和高等分析会议也于2014年召开。
  −
<ref>
  −
[http://datamining.it.uts.edu.au/conferences/dsaa14/ "2014 IEEE International Conference on Data Science and Advanced Analytics"]
  −
</ref>
  −
同年,编程训练营始祖[https://en.wikipedia.org/wiki/General_Assembly_(school) General Assembly]开启了学生付费培训,[https://en.wikipedia.org/wiki/The_Data_Incubator 数据孵化器公司]成立了一个富有竞争力的自由数据科学团体。
  −
<ref>
  −
[https://venturebeat.com/2014/04/15/ny-gets-new-bootcamp-for-data-scientists-its-free-but-harder-to-get-into-than-harvard/ "NY gets new bootcamp for data scientists: It’s free, but harder to get into than Harvard "]. 
  −
''Venture Beat'' Retrieved 2016-02-22.
  −
</ref>
  −
也是在2014年,[https://en.wikipedia.org/wiki/American_Statistical_Association 美国统计协会]的统计学习和数据挖掘部门将其期刊更名为“统计分析与数据挖掘:ASA数据科学期刊”,并在2016年将其部门更名为“统计学习与数据科学”。
   
<ref name="ASA">
 
<ref name="ASA">
 
Talley,Jill(2016-06-01)
 
Talley,Jill(2016-06-01)
第329行: 第250行:       −
2015年,Springer创办国际数据科学与分析杂志
  −
<ref>
  −
[https://www.springer.com/41060 "Journal on Data Science and Analytics"]
  −
</ref>
  −
,用来出版有关数据科学和大数据分析方面的原创性工作。2015年9月,[http://www.gfkl.org/welcome/ GfKI]在英国克彻斯特的[https://en.wikipedia.org/wiki/University_of_Essex 埃塞克斯大学]举办的第三届ECDA大会上增设“数据科学社团”。
       
198

个编辑