Compute the multi-dimensional non-normalized (i.e., without z-normalization) matrix profile with a `ray` cluster This is a highly distributed implementation around the Numba JIT-compiled parallelized `_maamp` function which computes the multi-dimensional matrix profile accordin
(
ray_client,
T_A,
T_B,
m,
excl_zone,
T_A_subseq_isfinite,
T_B_subseq_isfinite,
p,
include,
discords,
)
| 167 | |
| 168 | |
| 169 | def _ray_maamped( |
| 170 | ray_client, |
| 171 | T_A, |
| 172 | T_B, |
| 173 | m, |
| 174 | excl_zone, |
| 175 | T_A_subseq_isfinite, |
| 176 | T_B_subseq_isfinite, |
| 177 | p, |
| 178 | include, |
| 179 | discords, |
| 180 | ): |
| 181 | """ |
| 182 | Compute the multi-dimensional non-normalized (i.e., without z-normalization) matrix |
| 183 | profile with a `ray` cluster |
| 184 | |
| 185 | This is a highly distributed implementation around the Numba JIT-compiled |
| 186 | parallelized `_maamp` function which computes the multi-dimensional matrix |
| 187 | profile according to STOMP. Note that only self-joins are supported. |
| 188 | |
| 189 | Parameters |
| 190 | ---------- |
| 191 | ray_client : client |
| 192 | A `ray` client. Setting up a cluster is beyond the scope of this |
| 193 | library. Please refer to the `ray` documentation. |
| 194 | |
| 195 | T_A : numpy.ndarray |
| 196 | The time series or sequence for which to compute the multi-dimensional |
| 197 | matrix profile. Each row in `T_A` represents data from the same |
| 198 | dimension while each column in `T_A` represents data from a different |
| 199 | dimension. |
| 200 | |
| 201 | T_B : numpy.ndarray |
| 202 | The time series or sequence that will be used to annotate T_A. For every |
| 203 | subsequence in T_A, its nearest neighbor in T_B will be recorded. |
| 204 | |
| 205 | m : int |
| 206 | Window size |
| 207 | |
| 208 | excl_zone : int |
| 209 | The half width for the exclusion zone relative to the current |
| 210 | sliding window |
| 211 | |
| 212 | T_A_subseq_isfinite : numpy.ndarray |
| 213 | A boolean array that indicates whether a subsequence in `T_A` contains a |
| 214 | `np.nan`/`np.inf` value (False) |
| 215 | |
| 216 | T_B_subseq_isfinite : numpy.ndarray |
| 217 | A boolean array that indicates whether a subsequence in `T_B` contains a |
| 218 | `np.nan`/`np.inf` value (False) |
| 219 | |
| 220 | p : float |
| 221 | The p-norm to apply for computing the Minkowski distance. Minkowski distance is |
| 222 | typically used with `p` being 1 or 2, which correspond to the Manhattan distance |
| 223 | and the Euclidean distance, respectively. |
| 224 | |
| 225 | include : numpy.ndarray |
| 226 | A list of (zero-based) indices corresponding to the dimensions in `T` that |
nothing calls this directly
no test coverage detected