MCPcopy Index your code
hub / github.com/nebuly-ai/optimate

github.com/nebuly-ai/optimate @v0.9.0 sqlite

repository ↗ · DeepWiki ↗ · release v0.9.0 ↗
1,325 symbols 5,496 edges 184 files 254 documented · 19%
README

Plug and play modules to optimize the performances of your AI systems


Documentation: docs.nebuly.com/


Nebullvm is an ecosystem of plug and play modules to optimize the performances of your AI systems. The optimization modules are stack-agnostic and work with any library. They are designed to be easily integrated into your system, providing a quick and seamless boost to its performance. Simply plug and play to start realizing the benefits of optimized performance right away.

If you like the idea, give us a star to show your support for the project ⭐

What can this help with?

There are multiple modules we actually provide to boost the performances of your AI systems:

✅ Speedster: Automatically apply the best set of SOTA optimization techniques to achieve the maximum inference speed-up on your hardware.

Nos: Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elastic quotas.

ChatLLaMA: Create hyper-personalized ChatGPT-like assistants using your custom data and the least amount of compute possible.

✅ OpenAlphaTensor: Increase the computational performances of an AI model with custom-generated matrix multiplication algorithm fine-tuned for your specific hardware.

✅ Forward-Forward: The Forward Forward algorithm is a method for training deep neural networks that replaces the backpropagation forward and backward passes with two forward passes.

Next modules and roadmap

We are actively working on incorporating the following modules, as requested by members of our community, in upcoming releases:

  • [ ] GPToptimizer: Effortlessly optimize large APIs generative models from OpenAI, Cohere, HF.
  • [ ] CloudSurfer: Automatically discover the optimal cloud configuration and hardware on AWS, GCP and Azure to run your AI models.
  • [ ] OptiMate: Interactive tool guiding savvy users in achieving the best inference performance out of a given model / hardware setup.
  • [ ] TrainingSim: Easily simulate the training of large AI models on a distributed infrastructure to predict training behaviours without actual implementation.

Contributing

As an open source project in a rapidly evolving field, we welcome contributions of all kinds, including new features, improved infrastructure, and better documentation. If you're interested in contributing, please see the linked page for more information on how to get involved.


Join the community | Contribute to the library

Core symbols most depended-on inside this repo

to
called by 212
nebullvm/operations/base.py
append
called by 92
nebullvm/tools/transformations.py
info
called by 51
nebullvm/tools/diffusers.py
append
called by 49
apps/accelerate/chatllama/chatllama/rlhf/utils.py
to_torch_format
called by 48
nebullvm/tools/base.py
run
called by 35
nebullvm/operations/inference_learners/base.py
get
called by 34
nebullvm/tools/feedback_collector.py
execute
called by 34
nebullvm/operations/optimizations/compilers/base.py

Shape

Method 771
Function 309
Class 227
Route 18

Languages

Python100%

Modules by API surface

nebullvm/operations/inference_learners/base.py79 symbols
nebullvm/tools/diffusers.py77 symbols
apps/accelerate/forward_forward/forward_forward/utils/modules.py64 symbols
nebullvm/installers/installers.py38 symbols
apps/accelerate/chatllama/chatllama/llama_model.py36 symbols
nebullvm/operations/inference_learners/tensor_rt.py35 symbols
nebullvm/tools/base.py31 symbols
apps/accelerate/open_alpha_tensor/open_alpha_tensor/core/data/dataset.py27 symbols
nebullvm/tools/transformations.py24 symbols
nebullvm/operations/inference_learners/builders.py22 symbols
apps/accelerate/open_alpha_tensor/open_alpha_tensor/core/modules/heads.py21 symbols
nebullvm/installers/tests/test_install_frameworks.py20 symbols

Dependencies from manifests, versioned

PyYAML6.0 · 1×
Sphinx4.5.0 · 1×
loguru0.5.3 · 1×
nebullvm0.6.1 · 1×
numpy1.21.0 · 1×
packaging21.3 · 1×
psutil5.0.0 · 1×
py-cpuinfo8.0.0 · 1×
requests2.26.1 · 1×
tabulate0.8.0 · 1×
torch1.9 · 1×
torchvision0.10 · 1×

For agents

$ claude mcp add optimate \
  -- python -m otcore.mcp_server <graph>

⬇ download graph artifact