hub / github.com/scikit-learn/scikit-learn / scale

Function scale

sklearn/preprocessing/_data.py:146–302 · view source on GitHub ↗

Standardize a dataset along any axis. Center to the mean and component wise scale to unit variance. Read more in the :ref:`User Guide `. Parameters ---------- X : {array-like, sparse matrix} of shape (n_samples, n_features) The data to center and

(X, *, axis=0, with_mean=True, with_std=True, copy=True)

Source from the content-addressed store, hash-verified

144	prefer_skip_nested_validation=True,
145	)
146	def scale(X, *, axis=0, with_mean=True, with_std=True, copy=True):
147	"""Standardize a dataset along any axis.
148
149	Center to the mean and component wise scale to unit variance.
150
151	Read more in the :ref:`User Guide <preprocessing_scaler>`.
152
153	Parameters
154	----------
155	X : {array-like, sparse matrix} of shape (n_samples, n_features)
156	The data to center and scale.
157
158	axis : {0, 1}, default=0
159	Axis used to compute the means and standard deviations along. If 0,
160	independently standardize each feature, otherwise (if 1) standardize
161	each sample.
162
163	with_mean : bool, default=True
164	If True, center the data before scaling.
165
166	with_std : bool, default=True
167	If True, scale the data to unit variance (or equivalently,
168	unit standard deviation).
169
170	copy : bool, default=True
171	If False, try to avoid a copy and scale in place.
172	This is not guaranteed to always work in place; e.g. if the data is
173	a numpy array with an int dtype, a copy will be returned even with
174	copy=False.
175
176	Returns
177	-------
178	X_tr : {ndarray, sparse matrix} of shape (n_samples, n_features)
179	The transformed data.
180
181	See Also
182	--------
183	StandardScaler : Performs scaling to unit variance using the Transformer
184	API (e.g. as part of a preprocessing
185	:class:`~sklearn.pipeline.Pipeline`).
186
187	Notes
188	-----
189	This implementation will refuse to center scipy.sparse matrices
190	since it would make them non-sparse and would potentially crash the
191	program with memory exhaustion problems.
192
193	Instead the caller is expected to either set explicitly
194	`with_mean=False` (in that case, only variance scaling will be
195	performed on the features of the CSC matrix) or to call `X.toarray()`
196	if he/she expects the materialized dense array to fit in memory.
197
198	To avoid memory copy the caller should pass a CSC matrix.
199
200	NaNs are treated as missing values: disregarded to compute the statistics,
201	and maintained during the data transformation.
202
203	We use a biased estimator for the standard deviation, equivalent to

Callers 15

test_cross_validate_return_indicesFunction · 0.90

check_classifiers_multilabel_representation_invarianceFunction · 0.90

check_classifiers_multilabel_output_format_predictFunction · 0.90

check_classifiers_multilabel_output_format_predict_probaFunction · 0.90

check_classifiers_multilabel_output_format_decision_functionFunction · 0.90

check_regressors_trainFunction · 0.90

test_response.pyFile · 0.90

test_mlp.pyFile · 0.90

_estimate_miFunction · 0.90

test_standard_scaler_constant_featuresFunction · 0.90

test_scale_1dFunction · 0.90

test_standard_scaler_numerical_stabilityFunction · 0.90

Calls 4

check_arrayFunction · 0.90

mean_variance_axisFunction · 0.90

inplace_column_scaleFunction · 0.90

_handle_zeros_in_scaleFunction · 0.85

Tested by 15

test_cross_validate_return_indicesFunction · 0.72

test_standard_scaler_constant_featuresFunction · 0.72

test_scale_1dFunction · 0.72

test_standard_scaler_numerical_stabilityFunction · 0.72

test_scaler_2d_arraysFunction · 0.72

test_scale_sparse_with_mean_raise_exceptionFunction · 0.72

test_scale_input_finiteness_validationFunction · 0.72

test_scale_function_without_centeringFunction · 0.72

test_power_transformer_1dFunction · 0.72

test_power_transformer_2dFunction · 0.72

test_balanced_weightFunction · 0.72

test_sparsifyFunction · 0.72

Used in the wild real call sites across dependent graphs

searching dependent graphs…