ML之kNNC：基于iris莺尾花資料集(PCA處理+三維散點圖可視化)利用kNN算法實作分類預測

設計思路

輸出結果

(149, 5)

5.1 3.5 1.4 0.2 Iris-setosa

0 4.9 3.0 1.4 0.2 Iris-setosa

1 4.7 3.2 1.3 0.2 Iris-setosa

2 4.6 3.1 1.5 0.2 Iris-setosa

3 5.0 3.6 1.4 0.2 Iris-setosa

4 5.4 3.9 1.7 0.4 Iris-setosa

Sepal_Length Sepal_Width Petal_Length Petal_Width type

0 4.5 2.3 1.3 0.3 Iris-setosa

1 6.3 2.5 5.0 1.9 Iris-virginica

2 5.1 3.4 1.5 0.2 Iris-setosa

3 6.3 3.3 6.0 2.5 Iris-virginica

4 6.8 3.2 5.9 2.3 Iris-virginica

切分點： 29

label_classes: ['Iris-setosa', 'Iris-versicolor', 'Iris-virginica']

kNNDIY模型預測，基于原資料： 0.95

kNN模型預測，基于原資料預測： [0.96666667 1. 0.93333333 1. 0.93103448]

kNN模型預測，原資料PCA處理後： [1. 0.96 0.95918367]

核心代碼

class KNeighborsClassifier Found at: sklearn.neighbors._classification

class KNeighborsClassifier(NeighborsBase, KNeighborsMixin,

SupervisedIntegerMixin, ClassifierMixin):

"""Classifier implementing the k-nearest neighbors vote.

Read more in the :ref:`User Guide <classification>`.

Parameters

----------

n_neighbors : int, default=5

Number of neighbors to use by default for :meth:`kneighbors` queries.

weights : {'uniform', 'distance'} or callable, default='uniform'

weight function used in prediction. Possible values:

- 'uniform' : uniform weights. All points in each neighborhood

are weighted equally.

- 'distance' : weight points by the inverse of their distance.

in this case, closer neighbors of a query point will have a

greater influence than neighbors which are further away.

- [callable] : a user-defined function which accepts an

array of distances, and returns an array of the same shape

containing the weights.

algorithm : {'auto', 'ball_tree', 'kd_tree', 'brute'}, default='auto'

Algorithm used to compute the nearest neighbors:

- 'ball_tree' will use :class:`BallTree`

- 'kd_tree' will use :class:`KDTree`

- 'brute' will use a brute-force search.

- 'auto' will attempt to decide the most appropriate algorithm

based on the values passed to :meth:`fit` method.

Note: fitting on sparse input will override the setting of

this parameter, using brute force.

leaf_size : int, default=30

Leaf size passed to BallTree or KDTree. This can affect the

speed of the construction and query, as well as the memory

required to store the tree. The optimal value depends on the

nature of the problem.

p : int, default=2

Power parameter for the Minkowski metric. When p = 1, this is

equivalent to using manhattan_distance (l1), and euclidean_distance

(l2) for p = 2. For arbitrary p, minkowski_distance (l_p) is used.

metric : str or callable, default='minkowski'

the distance metric to use for the tree. The default metric is

minkowski, and with p=2 is equivalent to the standard Euclidean

metric. See the documentation of :class:`DistanceMetric` for a

list of available metrics.

If metric is "precomputed", X is assumed to be a distance matrix and

must be square during fit. X may be a :term:`sparse graph`,

in which case only "nonzero" elements may be considered neighbors.

metric_params : dict, default=None

Additional keyword arguments for the metric function.

n_jobs : int, default=None

The number of parallel jobs to run for neighbors search.

``None`` means 1 unless in a :obj:`joblib.parallel_backend` context.

``-1`` means using all processors. See :term:`Glossary <n_jobs>`

for more details.

Doesn't affect :meth:`fit` method.

Attributes

classes_ : array of shape (n_classes,)

Class labels known to the classifier

effective_metric_ : str or callble

The distance metric used. It will be same as the `metric` parameter

or a synonym of it, e.g. 'euclidean' if the `metric` parameter set to

'minkowski' and `p` parameter set to 2.

effective_metric_params_ : dict

Additional keyword arguments for the metric function. For most

metrics

will be same with `metric_params` parameter, but may also contain the

`p` parameter value if the `effective_metric_` attribute is set to

'minkowski'.

outputs_2d_ : bool

False when `y`'s shape is (n_samples, ) or (n_samples, 1) during fit

otherwise True.

Examples

--------

>>> X = [[0], [1], [2], [3]]

>>> y = [0, 0, 1, 1]

>>> from sklearn.neighbors import KNeighborsClassifier

>>> neigh = KNeighborsClassifier(n_neighbors=3)

>>> neigh.fit(X, y)

KNeighborsClassifier(...)

>>> print(neigh.predict([[1.1]]))

[0]

>>> print(neigh.predict_proba([[0.9]]))

[[0.66666667 0.33333333]]

ML之kNNC：基于iris莺尾花資料集(PCA處理+三維散點圖可視化)利用kNN算法實作分類預測

設計思路

輸出結果

核心代碼

繼續閱讀

Codeforces 1417 D. Make Them Equal(思維+構造)

查找算法之二分查找查找算法之二分查找

查找算法學習之二分查找（Python版本）——BinarySearch

CQ V1.0分詞bates(基于雙數組tire樹)—應該是目前最快的中文分詞算法

Command Network(POJ 3164)---定根最小樹形圖模闆題題目描述輸入格式輸出格式輸入樣例輸出樣例分析源程式

開源低帶寬語音編解碼器

241 Different Ways to Add Parentheses（C代碼版）

【趨高機器視覺】機器視覺技術原了解析及解決方案

CSMA/CD1． CSMA/CD的概述2． CSMA 的工作原理3． CSMA/CD控制規程及特點4． CSMA/CD協定5． CSMA/CD的優點6．結束語

極大似然法(ML)與最大期望法(EM)

C++ 第十五周報告1--《冒泡法排序》

筆試面試題目：滑動視窗(二)

資料結構與算法（27）——排序（二）

Dijkstra--簡易版（最短路徑）

GitHub連夜封殺！這份阿裡 10W 字内部 Java 字面試手冊到底有多強？

hdu7108哈希