[100] IRIS(鸢尾花)的数据集
数据编号: 100
# 数据描述
IRIS(鸢尾花)数据是数据科学领域最著名的数据集。该数据集被广泛用于数据分析、挖掘等入门教学,Fisher的论文是该领域的经典之作,至今仍被频繁引用。该数据集包含3种花的类别,每个类别50个样本。一类与另两类是线性可分的,后两类不是线性可分离的。
## 数据行数, 列数
150, 5
## 目标变量
鸢尾属植物类。
class: 类别
–Iris-setosa: 刚毛鸢尾
–Iris-versicolor: 花色鸢尾
–Iris-virginica: 弗吉尼亚鸢尾
## 特征变量
1.sepal length (cm): 萼片长度(厘米)
2.sepal width (cm): 萼片宽度(厘米)
3.petal length (cm): 花瓣长度(厘米)
4.petal width (cm): 花瓣宽度(厘米)
参考文献:
[1]Fisher,R.A. “The use of multiple measurements in taxonomic problems” Annual Eugenics, 7, Part II, 179-188 (1936); also in “Contributions to Mathematical Statistics” (John Wiley, NY, 1950).
[2]Duda,R.O., & Hart,P.E. (1973) Pattern Classification and Scene Analysis. (Q327.D83) John Wiley & Sons. ISBN 0-471-22361-1. See page 218.
[3]Dasarathy, B.V. (1980) “Nosing Around the Neighborhood: A New System Structure and Classification Rule for Recognition in Partially Exposed Environments”. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. PAMI-2, No. 1, 67-71.
[4]Gates, G.W. (1972) “The Reduced Nearest Neighbor Rule”. IEEE Transactions on Information Theory, May 1972, 431-433.
[5]王慧,冀晓亮.鸢尾花数据集剖析人工智能经典算法[J].科技与创新,2021(18):14-19+21.