|
|
8ea80fca4a
|
revert offline_inference/basic.py
Signed-off-by: Sage Moore <sage@neuralmagic.com>
|
2025-06-02 18:05:48 +00:00 |
|
|
|
21d9529a79
|
revert offline_inference/basic.py
Signed-off-by: Sage Moore <sage@neuralmagic.com>
|
2025-06-02 18:05:26 +00:00 |
|
|
|
ffb740ae95
|
manually manage stream
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
|
2025-05-22 20:51:36 +00:00 |
|
|
|
df8f889f37
|
support MLA
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
|
2025-05-22 20:51:35 +00:00 |
|
|
|
37c9babaa0
|
enable naive microbatching
Signed-off-by: Lucas Wilkinson <lwilkins@redhat.com>
|
2025-05-22 20:51:35 +00:00 |
|
|
|
6ae996a873
|
[Misc] refactor argument parsing in examples (#16635)
Signed-off-by: reidliu41 <reid201711@gmail.com>
Co-authored-by: reidliu41 <reid201711@gmail.com>
|
2025-04-15 08:05:30 +00:00 |
|
|
|
4ec2cee000
|
[Misc] improve example script output (#15528)
Signed-off-by: reidliu41 <reid201711@gmail.com>
Co-authored-by: reidliu41 <reid201711@gmail.com>
|
2025-03-26 10:12:47 +00:00 |
|
|
|
992e5c3d34
|
Merge similar examples in offline_inference into single basic example (#12737)
|
2025-02-20 04:53:51 -08:00 |
|