第208行: |
第208行: |
| === Modern usage === | | === Modern usage === |
| | | |
− | 现代用法 | + | '''现代用法''' |
| | | |
| The modern conception of data science as an independent discipline is sometimes attributed to [[William S. Cleveland]].<ref>{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}</ref> In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.<ref name=":1" /> "Data science" became more widely used in the next few years: in 2002, the [[Committee on Data for Science and Technology]] launched ''Data Science Journal.'' In 2003, Columbia University launched ''The Journal of Data Science''.<ref name=":1" /> In 2014, the [[American Statistical Association]]'s Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.<ref>{{Cite news|last=Talley|first=Jill|url=https://magazine.amstat.org/blog/2016/06/01/datascience-2/|title=ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science|date=June 1, 2016|work=Amstat News|access-date=|url-status=live|publisher=American Statistical Association}}</ref> | | The modern conception of data science as an independent discipline is sometimes attributed to [[William S. Cleveland]].<ref>{{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}}</ref> In a 2001 paper, he advocated an expansion of statistics beyond theory into technical areas; because this would significantly change the field, it warranted a new name.<ref name=":1" /> "Data science" became more widely used in the next few years: in 2002, the [[Committee on Data for Science and Technology]] launched ''Data Science Journal.'' In 2003, Columbia University launched ''The Journal of Data Science''.<ref name=":1" /> In 2014, the [[American Statistical Association]]'s Section on Statistical Learning and Data Mining changed its name to the Section on Statistical Learning and Data Science, reflecting the ascendant popularity of data science.<ref>{{Cite news|last=Talley|first=Jill|url=https://magazine.amstat.org/blog/2016/06/01/datascience-2/|title=ASA Expands Scope, Outreach to Foster Growth, Collaboration in Data Science|date=June 1, 2016|work=Amstat News|access-date=|url-status=live|publisher=American Statistical Association}}</ref> |
| | | |
− | | + | 数据科学作为一门独立学科的现代概念,一定程度上归功于威廉 · s · 克利夫兰(William S. Cleveland)。<ref> |
− | | |
− | | |
− | The professional title of “data scientist” has been attributed to [[DJ Patil]] and [[Jeff Hammerbacher]] in 2008.<ref>{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}</ref> Though it was used by the [[National Science Board]] in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
| |
− | | |
− | | |
− | | |
− | There is still no consensus on the definition of data science and it is considered by some to be a buzzword.<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref>
| |
− | | |
− | | |
− | | |
− | | |
− | | |
− | | |
− | | |
− | 数据科学作为一门独立学科的现代概念,一定程度上归功于威廉 · s · 克利夫兰 William S. Cleveland。<ref> | |
| {{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}} | | {{Cite web|url=https://www.stat.purdue.edu/~wsc/|title=William S Cleveland|last=Gupta|first=Shanti|date=December 11, 2015|website=|url-status=live|archive-url=|archive-date=|access-date=April 2, 2020}} |
− | </ref> | + | </ref> 在2001年的一篇论文中,他主张将统计学从理论扩展到技术领域;由于这将大大改变这个领域,因此它需要一个新的名称。在接下来的几年里,“数据科学”的应用更加广泛:2002年,数据科学与技术分会(CODATA) |
− | 在2001年,威廉·克利夫兰(William S.Cleveland)在他的文章《数据科学:一个用来扩大统计学领域技术范畴的行动计划》将数据科学作为一门独立学科引入,扩大了统计学的领域并使之包含“数据计算的前沿”,这篇文章发表在2001年4月版的《国际统计评论》(''International Statistical Review / Revue Internationale de Statistique'')的第69卷,第1篇上。
| |
− | <ref name="cleveland01">
| |
− | Cleveland, W. S. (2001).
| |
− | [https://pdfs.semanticscholar.org/915c/d8e2b39eb02723553913d592b2237d4d9960.pdf Data science: an action plan for expanding the technical areas of the field of statistics].
| |
− | International Statistical Review / Revue Internationale de Statistique, 21–26.
| |
− | </ref>
| |
− | 在他的报告中,他主张将统计学从理论扩展到技术领域; 因为这将大大改变这个领域,它需要一个新的名称。克利夫兰建立了他认为数据科学所关心的6个技术领域:多学科调查,数据模型和方法,数据计算,教学方法、工具评估及其理论。
| |
− | | |
− | | |
− | 在2002年4月,国际科学委员会(ICSU):数据科学与技术分会(CODATA)
| |
| <ref name="ics12"> | | <ref name="ics12"> |
| International Council for Science : Committee on Data for Science and Technology. (2012, April). | | International Council for Science : Committee on Data for Science and Technology. (2012, April). |
| CODATA, The Committee on Data for Science and Technology. Retrieved from International Council for Science : Committee on Data for Science and Technology: http://www.codata.org/ | | CODATA, The Committee on Data for Science and Technology. Retrieved from International Council for Science : Committee on Data for Science and Technology: http://www.codata.org/ |
| </ref> | | </ref> |
− | 创办了数据科学期刊(''Data Science Journal'') | + | 创办了数据科学期刊(''Data Science Journal'')。 |
| <ref name="dsj12"> | | <ref name="dsj12"> |
| Data Science Journal. (2012, April). | | Data Science Journal. (2012, April). |
| Available Volumes. | | Available Volumes. |
| Retrieved from Japan Science and Technology Information Aggregator, Electronic: http://www.jstage.jst.go.jp/browse/dsj/_vols | | Retrieved from Japan Science and Technology Information Aggregator, Electronic: http://www.jstage.jst.go.jp/browse/dsj/_vols |
− | </ref> | + | </ref>2014年,美国统计协会统计学习与数据挖掘部更名为统计学习与数据科学部,这反映了数据科学的日益普及。 |
− | ,这是一份聚焦于诸如数据系统描述及其网络出版物、应用和法律问题的出版物
| |
− | <ref name="dsj02">
| |
− | Data Science Journal. (2002, April).
| |
− | Contents of Volume 1, Issue 1, April 2002.
| |
− | Retrieved from Japan Science and Technology Information Aggregator,
| |
− | Electronic: http://www.jstage.jst.go.jp/browse/dsj/1/0/_contents
| |
− | </ref>
| |
− | 。之后不久,哥伦比亚大学在2003年1月开始出版数据科学期刊(''The Journal of Data Science'')
| |
− | <ref name="jds03">
| |
− | The Journal of Data Science. (2003, January).
| |
− | Contents of Volume 1, Issue 1, January 2003.
| |
− | Retrieved from http://www.jds-online.com/v1-1
| |
− | </ref>
| |
− | ,为所有数据工作者提供了发表意见和交流想法的平台。这份期刊衷心致力于统计学方法应用和定量研究。在2005年,美国国家科学委员会出版了“长期数字数据收集:赋能21世纪的研究和教育”这一内容,注明数据科学家为“信息和计算机科学家、数据库和软件程序员、学科专家、管理者和注解专家、图书管理员、档案保管员,以及其它对数字化数据收集的成功管理起到关键性作用的人。”他们的首要活动是“进行创造性探究与分析。”
| |
− | <ref>
| |
− | National Science Board.
| |
− | [http://www.nsf.gov/pubs/2005/nsb0540/ Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century]
| |
− | . National Science Foundation
| |
− | . Retrieved 30 June 2013.
| |
− | </ref>
| |
| | | |
| | | |
− | 在2007年左右,
| + | The professional title of “data scientist” has been attributed to [[DJ Patil]] and [[Jeff Hammerbacher]] in 2008.<ref>{{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}</ref> Though it was used by the [[National Science Board]] in their 2005 report, "Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century," it referred broadly to any key role in managing a digital data collection.<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref> |
− | <ref> | |
− | Citation needed
| |
− | </ref>
| |
− | 图灵奖得主[https://en.wikipedia.org/wiki/Jim_Gray_(computer_scientist) 吉姆·格雷](Jim Gray)预见到使用大数据的分析计算作为主要科学方法的“数据驱动的科学”将成为科学的第四范式
| |
− | <ref name="TansleyTolle2009" /> | |
− | <ref name="BellHey2009" />
| |
− | ,我们将迎来一个科学文献、科学数据全部链接且彼此运用的世界。
| |
− | <ref> | |
− | Markoff,John(2009-12-14).
| |
− | [https://www.nytimes.com/2009/12/15/science/15books.html "Essays Inspired by Microsoft’s Jim Gray, Who Saw Science Paradigm Shift"]. ''The New York Times''. Retrieved 2018-04-26.
| |
− | </ref> | |
| | | |
| + | 2008年,DJ Patil和Jeff Hammerbacher提出了“数据科学家”这个职称名词。<ref> |
| + | {{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}} |
| + | </ref>尽管美国国家科学委员会(National Science Board)在其2005年的报告《长期数字数据收集: 促进21世纪的研究和教育》(Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century)中使用了这个词,但它泛指在数字化数据收集管理中起关键性作用的人。<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref> |
| | | |
− | 2008年,DJ 帕蒂尔 DJ Patil和杰夫哈默巴赫尔 Jeff Hammerbacher被授予“数据科学家”的职称。<ref>
| |
− | {{Cite news|last=Davenport|first=Thomas H.|url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century|title=Data Scientist: The Sexiest Job of the 21st Century|date=2012-10-01|work=Harvard Business Review|access-date=2020-04-03|last2=Patil|first2=D. J.|issue=October 2012|issn=0017-8012}}
| |
− | </ref>尽管美国国家科学委员会(National Science Board)在其2005年的报告《长期数字数据收集: 21世纪的研究和教育成果》(Long-Lived Digital Data Collections: Enabling Research and Education in the 21st Century)中使用了这个词,但它泛指管理数字数据收集的任何关键角色。<ref>{{Cite web|url=https://www.nsf.gov/pubs/2005/nsb0540/|title=US NSF - NSB-05-40, Long-Lived Digital Data Collections Enabling Research and Education in the 21st Century|website=www.nsf.gov|access-date=2020-04-03}}</ref>
| |
| | | |
| + | There is still no consensus on the definition of data science and it is considered by some to be a buzzword.<ref>{{Cite web|url=https://www.forbes.com/sites/gilpress/2013/08/19/data-science-whats-the-half-life-of-a-buzzword/|title=Data Science: What's The Half-Life Of A Buzzword?|last=Press|first=Gil|website=Forbes|language=en|access-date=2020-04-03}}</ref> |
| | | |
− | 在2012年[https://en.wikipedia.org/wiki/Harvard_Business_Review 《哈佛商业评论》]的报道“数据科学家:21世纪最富有魅力的工作”中
| + | 对于数据科学的定义还没有达成共识,有些人认为这是一个时髦的词。 |
− | <ref name="Harvard">
| |
− | {{Cite journal
| |
− | |url=https://hbr.org/2012/10/data-scientist-the-sexiest-job-of-the-21st-century/
| |
− | |title=Data Scientist: The Sexiest Job of the 21st Century
| |
− | |publisher=Harvard Business Review
| |
− | |first=Thomas H.
| |
− | |last=Davenport
| |
− | |first2=DJ
| |
− | |last2=Patil
| |
− | |date=Oct 2012
| |
− | }}</ref>
| |
− | ,[https://en.wikipedia.org/wiki/DJ_Patil 帕蒂尔](DJ Patil)声称其已于2008年和[https://en.wikipedia.org/wiki/Jeff_Hammerbacher 杰弗·哈梅巴赫](Jeff Hammerbacher)共同创造了这一术语,用以标注他们在领英和脸书上的职业信息。他断言数据科学家将是一种全新的职业类型,并且数据科学家的短缺正成为某些领域的严重掣肘,但同时也将其描述为一个更加商业化导向的角色。
| |
| | | |
| | | |
− | 2013年,IEEE数据科学和高等分析专门工作组
| + | 帕蒂尔](DJ Patil)声称其已于2008年和[https://en.wikipedia.org/wiki/Jeff_Hammerbacher 杰弗·哈梅巴赫](Jeff Hammerbacher)共同创造了这一术语 |
− | <ref>
| + | 统计学习和数据挖掘部门将其期刊更名为“统计分析与数据挖掘:ASA数据科学期刊”,并在2016年将其部门更名为“统计学习与数据科学”。 |
− | [http://www.dsaa.co "IEEE Task Force on Data Science and Advanced Analytics"]
| |
− | </ref>
| |
− | 成立,同年第一届“欧洲数据分析大会(ECDA)”在卢森堡召开,会上成立了[http://euads.org/ 欧洲数据科学协会](EuADS)。第一届国际会议——IEEE国际数据科学和高等分析会议也于2014年召开。
| |
− | <ref>
| |
− | [http://datamining.it.uts.edu.au/conferences/dsaa14/ "2014 IEEE International Conference on Data Science and Advanced Analytics"]
| |
− | </ref>
| |
− | 同年,编程训练营始祖[https://en.wikipedia.org/wiki/General_Assembly_(school) General Assembly]开启了学生付费培训,[https://en.wikipedia.org/wiki/The_Data_Incubator 数据孵化器公司]成立了一个富有竞争力的自由数据科学团体。
| |
− | <ref>
| |
− | [https://venturebeat.com/2014/04/15/ny-gets-new-bootcamp-for-data-scientists-its-free-but-harder-to-get-into-than-harvard/ "NY gets new bootcamp for data scientists: It’s free, but harder to get into than Harvard "].
| |
− | ''Venture Beat'' Retrieved 2016-02-22.
| |
− | </ref>
| |
− | 也是在2014年,[https://en.wikipedia.org/wiki/American_Statistical_Association 美国统计协会]的统计学习和数据挖掘部门将其期刊更名为“统计分析与数据挖掘:ASA数据科学期刊”,并在2016年将其部门更名为“统计学习与数据科学”。
| |
| <ref name="ASA"> | | <ref name="ASA"> |
| Talley,Jill(2016-06-01) | | Talley,Jill(2016-06-01) |
第329行: |
第250行: |
| | | |
| | | |
− | 2015年,Springer创办国际数据科学与分析杂志
| |
− | <ref>
| |
− | [https://www.springer.com/41060 "Journal on Data Science and Analytics"]
| |
− | </ref>
| |
− | ,用来出版有关数据科学和大数据分析方面的原创性工作。2015年9月,[http://www.gfkl.org/welcome/ GfKI]在英国克彻斯特的[https://en.wikipedia.org/wiki/University_of_Essex 埃塞克斯大学]举办的第三届ECDA大会上增设“数据科学社团”。
| |
| | | |
| | | |