-
Notifications
You must be signed in to change notification settings - Fork 3.3k
Unsupported param: tools #530
Description
model_name ggml-model-i2_s.gguf
why?
slot print_timing: id 0 | task 0 |
prompt eval time = 11641.76 ms / 291 tokens ( 40.01 ms per token, 25.00 tokens per second)
eval time = 1541.79 ms / 12 tokens ( 128.48 ms per token, 7.78 tokens per second)
total time = 13183.55 ms / 303 tokens
srv update_slots: all slots are idle
request: POST /v1/chat/completions 172.25.218.178 200
terminate called after throwing an instance of 'std::runtime_error'
what(): Unsupported param: tools
Error occurred while running command: Command '['build/bin/llama-server', '-m', 'models/BitNet-b1.58-2B-4T/ggml-model-i2_s.gguf', '-c', '4096', '-t', '5', '-n', '4096', '-ngl', '0', '--temp', '0.8', '--host', '0.0.0.0', '--port', '8000', '-cb', '-p', 'You are a helpful assistant']' died with <Signals.SIGABRT: 6>.