hub / github.com/scikit-learn/scikit-learn / DecisionTreeClassifier

Class DecisionTreeClassifier

sklearn/tree/_classes.py:699–1097 · view source on GitHub ↗

A decision tree classifier. Read more in the :ref:`User Guide `. Parameters ---------- criterion : {"gini", "entropy", "log_loss"}, default="gini" The function to measure the quality of a split. Supported criteria are "gini" for the Gini impurity and "log_loss

Source from the content-addressed store, hash-verified

697
698
699	class DecisionTreeClassifier(ClassifierMixin, BaseDecisionTree):
700	"""A decision tree classifier.
701
702	Read more in the :ref:`User Guide <tree>`.
703
704	Parameters
705	----------
706	criterion : {"gini", "entropy", "log_loss"}, default="gini"
707	The function to measure the quality of a split. Supported criteria are
708	"gini" for the Gini impurity and "log_loss" and "entropy" both for the
709	Shannon information gain, see :ref:`tree_mathematical_formulation`.
710
711	splitter : {"best", "random"}, default="best"
712	The strategy used to choose the split at each node. Supported
713	strategies are "best" to choose the best split and "random" to choose
714	the best random split.
715
716	max_depth : int, default=None
717	The maximum depth of the tree. If None, then nodes are expanded until
718	all leaves are pure or until all leaves contain less than
719	min_samples_split samples.
720
721	min_samples_split : int or float, default=2
722	The minimum number of samples required to split an internal node:
723
724	- If int, then consider `min_samples_split` as the minimum number.
725	- If float, then `min_samples_split` is a fraction and
726	`ceil(min_samples_split * n_samples)` are the minimum
727	number of samples for each split.
728
729	.. versionchanged:: 0.18
730	Added float values for fractions.
731
732	min_samples_leaf : int or float, default=1
733	The minimum number of samples required to be at a leaf node.
734	A split point at any depth will only be considered if it leaves at
735	least ``min_samples_leaf`` training samples in each of the left and
736	right branches. This may have the effect of smoothing the model,
737	especially in regression.
738
739	- If int, then consider `min_samples_leaf` as the minimum number.
740	- If float, then `min_samples_leaf` is a fraction and
741	`ceil(min_samples_leaf * n_samples)` are the minimum
742	number of samples for each node.
743
744	.. versionchanged:: 0.18
745	Added float values for fractions.
746
747	min_weight_fraction_leaf : float, default=0.0
748	The minimum weighted fraction of the sum total of weights (of all
749	the input samples) required to be at a leaf node. Samples have
750	equal weight when sample_weight is not provided.
751
752	max_features : int, float or {"sqrt", "log2"}, default=None
753	The number of features to consider when looking for the best split:
754
755	- If int, then consider `max_features` features at each split.
756	- If float, then `max_features` is a fraction and

Callers 15

test_curve_display_parameters_validationFunction · 0.90

test_learning_curve_display_default_usageFunction · 0.90

test_validation_curve_display_default_usageFunction · 0.90

test_curve_display_negate_scoreFunction · 0.90

test_curve_display_score_nameFunction · 0.90

test_learning_curve_display_score_typeFunction · 0.90

test_validation_curve_display_score_typeFunction · 0.90

test_curve_display_xscale_autoFunction · 0.90

test_curve_display_std_display_styleFunction · 0.90

test_curve_display_plot_kwargsFunction · 0.90

test_validation_curve_xscale_from_param_range_provided_as_a_listFunction · 0.90

test_subclassing_displaysFunction · 0.90

Calls 2

StrOptionsClass · 0.90

HiddenClass · 0.90

Tested by 15

test_curve_display_parameters_validationFunction · 0.72

test_learning_curve_display_default_usageFunction · 0.72

test_validation_curve_display_default_usageFunction · 0.72

test_curve_display_negate_scoreFunction · 0.72

test_curve_display_score_nameFunction · 0.72

test_learning_curve_display_score_typeFunction · 0.72

test_validation_curve_display_score_typeFunction · 0.72

test_curve_display_xscale_autoFunction · 0.72

test_curve_display_std_display_styleFunction · 0.72

test_curve_display_plot_kwargsFunction · 0.72

test_validation_curve_xscale_from_param_range_provided_as_a_listFunction · 0.72

test_subclassing_displaysFunction · 0.72

Used in the wild real call sites across dependent graphs

searching dependent graphs…