Compute the multi-dimensional z-normalized matrix profile with a ``dask``/``ray`` cluster This is a highly distributed implementation around the Numba JIT-compiled parallelized ``_mstump`` function which computes the multi-dimensional matrix profile according to STOMP. Note tha
(
client,
T,
m,
include=None,
discords=False,
p=2.0,
normalize=True,
T_subseq_isconstant=None,
)
| 362 | exclude=["normalize", "T_subseq_isconstant"], |
| 363 | ) |
| 364 | def mstumped( |
| 365 | client, |
| 366 | T, |
| 367 | m, |
| 368 | include=None, |
| 369 | discords=False, |
| 370 | p=2.0, |
| 371 | normalize=True, |
| 372 | T_subseq_isconstant=None, |
| 373 | ): |
| 374 | """ |
| 375 | Compute the multi-dimensional z-normalized matrix profile with a |
| 376 | ``dask``/``ray`` cluster |
| 377 | |
| 378 | This is a highly distributed implementation around the Numba JIT-compiled |
| 379 | parallelized ``_mstump`` function which computes the multi-dimensional matrix |
| 380 | profile according to STOMP. Note that only self-joins are supported. |
| 381 | |
| 382 | Parameters |
| 383 | ---------- |
| 384 | client : client |
| 385 | A ``dask``/``ray`` client. Setting up a cluster is beyond the scope of this |
| 386 | library. Please refer to the ``dask``/``ray`` documentation. |
| 387 | |
| 388 | T : numpy.ndarray |
| 389 | The time series or sequence for which to compute the multi-dimensional |
| 390 | matrix profile. Each row in ``T`` represents data from the same |
| 391 | dimension while each column in ``T`` represents data from a different |
| 392 | dimension. |
| 393 | |
| 394 | m : int |
| 395 | Window size. |
| 396 | |
| 397 | include : list, numpy.ndarray, default None |
| 398 | A list of (zero-based) indices corresponding to the dimensions in ``T`` that |
| 399 | must be included in the constrained multidimensional motif search. |
| 400 | For more information, see Section IV D in: |
| 401 | |
| 402 | `DOI: 10.1109/ICDM.2017.66 \ |
| 403 | <https://www.cs.ucr.edu/~eamonn/Motif_Discovery_ICDM.pdf>`__ |
| 404 | |
| 405 | discords : bool, default False |
| 406 | When set to ``True``, this reverses the distance matrix which results in a |
| 407 | multi-dimensional matrix profile that favors larger matrix profile values |
| 408 | (i.e., discords) rather than smaller values (i.e., motifs). Note that indices |
| 409 | in `include` are still maintained and respected. |
| 410 | |
| 411 | p : float, default 2.0 |
| 412 | The p-norm to apply for computing the Minkowski distance. Minkowski distance is |
| 413 | typically used with ``p`` being ``1`` or ``2``, which correspond to the |
| 414 | Manhattan distance and the Euclidean distance, respectively. |
| 415 | |
| 416 | normalize : bool, default True |
| 417 | When set to ``True``, this z-normalizes subsequences prior to computing |
| 418 | distances. Otherwise, this function gets re-routed to its complementary |
| 419 | non-normalized equivalent set in the ``@core.non_normalized`` function |
| 420 | decorator. |
| 421 |