MCPcopy
hub / github.com/stumpy-dev/stumpy / mstumped

Function mstumped

stumpy/mstumped.py:364–537  ·  view source on GitHub ↗

Compute the multi-dimensional z-normalized matrix profile with a ``dask``/``ray`` cluster This is a highly distributed implementation around the Numba JIT-compiled parallelized ``_mstump`` function which computes the multi-dimensional matrix profile according to STOMP. Note tha

(
    client,
    T,
    m,
    include=None,
    discords=False,
    p=2.0,
    normalize=True,
    T_subseq_isconstant=None,
)

Source from the content-addressed store, hash-verified

362 exclude=["normalize", "T_subseq_isconstant"],
363)
364def mstumped(
365 client,
366 T,
367 m,
368 include=None,
369 discords=False,
370 p=2.0,
371 normalize=True,
372 T_subseq_isconstant=None,
373):
374 """
375 Compute the multi-dimensional z-normalized matrix profile with a
376 ``dask``/``ray`` cluster
377
378 This is a highly distributed implementation around the Numba JIT-compiled
379 parallelized ``_mstump`` function which computes the multi-dimensional matrix
380 profile according to STOMP. Note that only self-joins are supported.
381
382 Parameters
383 ----------
384 client : client
385 A ``dask``/``ray`` client. Setting up a cluster is beyond the scope of this
386 library. Please refer to the ``dask``/``ray`` documentation.
387
388 T : numpy.ndarray
389 The time series or sequence for which to compute the multi-dimensional
390 matrix profile. Each row in ``T`` represents data from the same
391 dimension while each column in ``T`` represents data from a different
392 dimension.
393
394 m : int
395 Window size.
396
397 include : list, numpy.ndarray, default None
398 A list of (zero-based) indices corresponding to the dimensions in ``T`` that
399 must be included in the constrained multidimensional motif search.
400 For more information, see Section IV D in:
401
402 `DOI: 10.1109/ICDM.2017.66 \
403 <https://www.cs.ucr.edu/~eamonn/Motif_Discovery_ICDM.pdf>`__
404
405 discords : bool, default False
406 When set to ``True``, this reverses the distance matrix which results in a
407 multi-dimensional matrix profile that favors larger matrix profile values
408 (i.e., discords) rather than smaller values (i.e., motifs). Note that indices
409 in `include` are still maintained and respected.
410
411 p : float, default 2.0
412 The p-norm to apply for computing the Minkowski distance. Minkowski distance is
413 typically used with ``p`` being ``1`` or ``2``, which correspond to the
414 Manhattan distance and the Euclidean distance, respectively.
415
416 normalize : bool, default True
417 When set to ``True``, this z-normalizes subsequences prior to computing
418 distances. Otherwise, this function gets re-routed to its complementary
419 non-normalized equivalent set in the ``@core.non_normalized`` function
420 decorator.
421

Calls 1

mparrayClass · 0.90