MCPcopy
hub / github.com/InternLM/lmdeploy / cache_max_entry_count

Method cache_max_entry_count

lmdeploy/cli/utils.py:521–528  ·  view source on GitHub ↗

Add argument cache_max_entry_count to parser.

(parser)

Source from the content-addressed store, hash-verified

519
520 @staticmethod
521 def cache_max_entry_count(parser):
522 """Add argument cache_max_entry_count to parser."""
523
524 return parser.add_argument('--cache-max-entry-count',
525 type=float,
526 default=0.8,
527 help='The percentage of free gpu memory occupied by the k/v '
528 'cache, excluding weights ')
529
530 @staticmethod
531 def adapters(parser):

Callers 5

add_parser_chatMethod · 0.80
add_parser_api_serverMethod · 0.80
parse_argsFunction · 0.80
parse_argsFunction · 0.80
parse_argsFunction · 0.80

Calls

no outgoing calls

Tested by

no test coverage detected