feat: add mem usage by Cifko · Pull Request #651 · AtomaAI/atoma-node

Cifko · 2025-06-02T09:51:59Z

Add memory usage cap at 0.9

jorgeantonio21

Looking good, but I believe some parts of the logic needs to be refactored

jorgeantonio21 · 2025-06-02T12:26:59Z

 const DEFAULT_MAX_TOKENS: u64 = 8_192;

+/// The ceiling for memory usage, above which the service will not accept new requests
+const MEMORY_USAGE_CEILING: f64 = 0.9;


I reckon these values should be set in the config.toml, as we will most likely need to tweak them

* fix: ensure client errors are correctly tracked (#635) * fix: ensure client errors are correctly tracked * chore: update error tracking * chore: adjust clippy * chore: grammatical error * ci: use stable toolchain (#645) * ci: use stable toolchain * chore: fix clippy issues * revert to use prometheus for queued requests (#646) * revert to use prometheus for queued requests * add start metrics collector * update logs * feat: turn on too many requests for a period of time (#647) * feat: add request running cap (#649) * feat: add request running cap * fix clippy --------- Co-authored-by: Jorge Antonio <matroid@outlook.com> * refactor num running requests for prometheus check * logs * handle deadlock for too many requests timeout trigger check (#650) * feat: add mem usage (#651) * feat: add memusage to get_metrics * add lower threshold for disabling the flag * fix clippy * address 2 comments * add values to config * fix * fix tests * fix name * feat: update sui dependencies (#654) * resolve compilation issues * ci: add caching strategy for ci * ci: optimize coverage job * ci: adjust coverage job * ci: update deny action * ci: use grcov * ci: use stable toolchain * ci: only run tests once * ci: move coverage to test file * ci: use --codecov flag & stable toolchain * ci: discard p2p tester --------- Co-authored-by: chad <chad.nehemiah94@gmail.com> * feat: add max number of queued requests configuration and update request handling (#656) * fix: correct deadlock in `check_if_too_many_requests` (#658) * correct deadlock in check_if_too_many_requests method * resolve tests * add changes * add changes * continue improving logic * add changes * fix: normalize model strings to lowercase in request handlers (#661) * fix: normalize model strings to lowercase in request handlers * fix test * fix --------- Co-authored-by: Chad Nehemiah <chad.nehemiah94@gmail.com> Co-authored-by: Martin Stefcek <35243812+Cifko@users.noreply.github.com>

feat: add memusage to get_metrics

8ddeaeb

Cifko changed the base branch from main to release/v0.1.14 June 2, 2025 09:52

Cifko added 2 commits June 2, 2025 13:45

add lower threshold for disabling the flag

5437d69

fix clippy

65ee08e

jorgeantonio21 requested changes Jun 2, 2025

View reviewed changes

Cifko added 5 commits June 2, 2025 14:35

address 2 comments

a3be3d4

add values to config

6843b92

fix

c47fdcc

fix tests

cef38ce

fix name

86621b0

jorgeantonio21 approved these changes Jun 2, 2025

View reviewed changes

jorgeantonio21 merged commit 6b669a7 into release/v0.1.14 Jun 2, 2025
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add mem usage#651

feat: add mem usage#651
jorgeantonio21 merged 8 commits into
release/v0.1.14from
add-mem-usage

Cifko commented Jun 2, 2025

Uh oh!

jorgeantonio21 left a comment

Uh oh!

jorgeantonio21 Jun 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Cifko commented Jun 2, 2025

Uh oh!

jorgeantonio21 left a comment

Choose a reason for hiding this comment

Uh oh!

jorgeantonio21 Jun 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants