File train_mem.py

fastchat/train/train_mem.py:None–None · view source on GitHub ↗

Source from the content-addressed store, hash-verified

1	# Make it more memory efficient by monkey patching the LLaMA model with FlashAttn.
2
3	# Need to call this before importing transformers.
4	from fastchat.train.llama2_flash_attn_monkey_patch import (

nothing calls this directly

replace_llama_attn_with_flash_attnFunction · 0.90

trainFunction · 0.90

no test coverage detected