第17行: |
第17行: |
| Automated machine learning (AutoML) is the process of automating the process of applying machine learning to real-world problems. AutoML covers the complete pipeline from the raw dataset to the deployable machine learning model. AutoML was proposed as an artificial intelligence-based solution to the ever-growing challenge of applying machine learning. The high degree of automation in AutoML allows non-experts to make use of machine learning models and techniques without requiring to become an expert in this field first. | | Automated machine learning (AutoML) is the process of automating the process of applying machine learning to real-world problems. AutoML covers the complete pipeline from the raw dataset to the deployable machine learning model. AutoML was proposed as an artificial intelligence-based solution to the ever-growing challenge of applying machine learning. The high degree of automation in AutoML allows non-experts to make use of machine learning models and techniques without requiring to become an expert in this field first. |
| | | |
− | '''<font color="#ff8000">自动机器学习 Automated machine learning,AutoML</font>'''是可以把机器学习应用于实际问题这一过程自动化的方法。自动机器学习涵盖了从原始数据集到可部署机器学习模型的整个流程。作为一种基于人工智能的解决方案,自动机器学习被提出来应对机器学习应用方面的日益增长的挑战。自动机器学习中的高度自动化允许非专家使用机器学习模型和技术并且不需要首先成为这个领域的专家。 | + | '''<font color="#ff8000">自动机器学习 Automated machine learning,AutoML</font>'''是可以将机器学习应用于实际问题这一过程自动化的方法。自动机器学习涵盖了从原始数据集到可部署机器学习模型的整个流程。作为一种基于人工智能的解决方案,自动机器学习被提出来用于应对机器学习应用方面日益增长的挑战。自动机器学习中高度的自动化允许非专家使用机器学习的模型和技术并且不需要已经成为这个领域的专家。 |
| | | |
| | | |
第25行: |
第25行: |
| Automating the process of applying machine learning end-to-end additionally offers the advantages of producing simpler solutions, faster creation of those solutions, and models that often outperform hand-designed models. | | Automating the process of applying machine learning end-to-end additionally offers the advantages of producing simpler solutions, faster creation of those solutions, and models that often outperform hand-designed models. |
| | | |
− | 在机器学习的应用中,将端到端的过程自动化还提供了更多的优势:生成更简单的解决方案、更快地创建这些解决方案以及在通常情况下优于人工设计的模型。
| + | 在机器学习的应用中,将端到端的过程自动化可以提供更多的优势:生成更简单的解决方案、更快地创建这些解决方案以及在通常情况下设计出优于人工设计的模型。 |
| | | |
| | | |
第36行: |
第36行: |
| In a typical machine learning application, practitioners have a dataset consisting of input data points to train on. The raw data itself may not be in a form such that all algorithms may be applicable to it out of the box. An expert may have to apply appropriate data pre-processing, feature engineering, feature extraction, and feature selection methods that make the dataset amenable for machine learning. Following those preprocessing steps, practitioners must then perform algorithm selection and hyperparameter optimization to maximize the predictive performance of their machine learning model. Clearly all of those steps induce their own challenges, accumulating to a significant hurdle to get started with machine learning. | | In a typical machine learning application, practitioners have a dataset consisting of input data points to train on. The raw data itself may not be in a form such that all algorithms may be applicable to it out of the box. An expert may have to apply appropriate data pre-processing, feature engineering, feature extraction, and feature selection methods that make the dataset amenable for machine learning. Following those preprocessing steps, practitioners must then perform algorithm selection and hyperparameter optimization to maximize the predictive performance of their machine learning model. Clearly all of those steps induce their own challenges, accumulating to a significant hurdle to get started with machine learning. |
| | | |
− | 在一个典型的机器学习应用程序中,程序的使用者有一个由输入数据点组成的数据集来进行训练。原始数据本身的形式可能并不适用于所有算法。专家可能需要应用适当的数据预处理、特征工程、特征提取和特征选择这样的方法,使数据集适合机器学习。按照这些预处理步骤,从业人员必须执行算法选择和超参数优化,以最大限度地提高他们的机器学习模型的预测性能。显然,所有这些步骤都为自身带来了挑战,当累积到了一定程度就成为机器学习的一个重大障碍。
| + | 在一个典型的机器学习应用程序中,程序的使用者有一个由输入数据点组成的数据集来进行训练。原始数据本身的形式可能并不适用于所有算法。专家可能需要应用适当的数据预处理、特征工程、特征提取和特征选择这样的方法,使数据集适合机器学习。按照这些预处理步骤,程序的使用者必须执行算法的选择和超参数优化,以最大限度地提高他们的机器学习模型的预测性能。显然,所有这些步骤都为自身带来了挑战,当累积到了一定程度就成为机器学习的一个重大障碍。 |
| | | |
| | | |
第65行: |
第65行: |
| ==[[用户:Yuling|Yuling]]([[用户讨论:Yuling|讨论]]) categorical text feature, or free text feature 这两个应该是专业词汇,没有查到具体的翻译 | | ==[[用户:Yuling|Yuling]]([[用户讨论:Yuling|讨论]]) categorical text feature, or free text feature 这两个应该是专业词汇,没有查到具体的翻译 |
| ** Automated task detection; e.g., [[binary classification]], [[regression analysis|regression]], clustering, or [[learning to rank|ranking]] | | ** Automated task detection; e.g., [[binary classification]], [[regression analysis|regression]], clustering, or [[learning to rank|ranking]] |
− | 自动化任务检测,例如:二分类,回归分析/回归,聚类,排序学习
| + | 自动化任务检测,例如:二分类,回归分析,聚类,排序学习 |
| * Automated [[feature engineering]] | | * Automated [[feature engineering]] |
| 自动特征工程 | | 自动特征工程 |
第79行: |
第79行: |
| 自动化模型选择 | | 自动化模型选择 |
| * [[Hyperparameter (machine learning)#Optimization|Hyperparameter optimization]] of the learning algorithm and featurization | | * [[Hyperparameter (machine learning)#Optimization|Hyperparameter optimization]] of the learning algorithm and featurization |
− | | + | 特征工程和学习算法中的超参数优化 |
| * Automated pipeline selection under time, memory, and complexity constraints | | * Automated pipeline selection under time, memory, and complexity constraints |
− | | + | 在时间,内存和复杂性约束下流程的自动选择 |
| * Automated selection of evaluation metrics / validation procedures | | * Automated selection of evaluation metrics / validation procedures |
− | | + | 自动选择评估指标/验证程序 |
| * Automated problem checking | | * Automated problem checking |
− | | + | 自动化问题检测 |
| ** Leakage detection | | ** Leakage detection |
− | | + | 数据泄露检测 |
| ** Misconfiguration detection | | ** Misconfiguration detection |
− | | + | 配置错误检测 |
| * Automated analysis of results obtained | | * Automated analysis of results obtained |
− | | + | 自动分析获得的结果 |
| * User interfaces and visualizations for automated machine learning | | * User interfaces and visualizations for automated machine learning |
− | | + | 用于自动化机器学习的用户界面及可视化 |
| | | |
| | | |