Nox-Lumen MfgNox-Lumen Mfg

Agent settings

Profile → Agents is combo agent’s deepest tuning surface. Plan Mode chooses how to plan; here you choose what to remember, forget, compress, and retrieve.

Concepts: Observability, Multi-tenancy, Architecture.

1. Page overview

Entry: Avatar → My settings → Agent (brain icon).

Six collapse panels

Toolbar:

ButtonRole
Unsaved / SavedStatus lamp — orange when dirty
View JSONDebug / backup
Reset defaultsFactory reset (confirm)
SavePersist for future sessions
Rendering diagram…

These are defaults for all your Agents. Single-Agent overrides exist; Plan Mode per session overrides further. Priority: session Plan Mode > Agent override > global default.

2. Panel ① Compaction

Near context limits the system summarizes older turns to free tokens — key to long chats.

Compaction panel

2.1 compactionMode

ModeDescriptionUse
Standard defaultSummarize older messages; keep last N + summaryDefault daily driver
Safeguard safeguardConservative; triggers closer to limitCompliance / audits
Off offNo compaction — hard truncationDebug only

2.2 Key fields

FieldRangeMeaning
reserveTokensFloor1K–100KMinimum free tokens after compaction
keepLastMessages1–100Recent turns untouched
summaryMaxTokens100–10KSummary cap

2.3 Memory flush

Side effect: compacted turns + summaries can flush into LTM for cross-session recall.

FieldMeaning
Enable flushOn compaction → optional LTM write
softThresholdTokensBegin flush earlier
System promptHow to extract long-term facts
User promptExtra extraction instructions

3. Panel ② Context pruning

Compaction handles history; pruning trims individual tool payloads (e.g., a giant SQL dump).

Pruning panel

3.1 pruningMode

ModeDescription
Off offNo trimming
Adaptive adaptive (default)Soft trim + optional hard clear
Aggressive aggressiveEarlier trims for tight windows

3.2 Key fields

FieldRangeMeaning
keepLastAssistants0–20Recent assistant turns protected
softTrimRatio0–1When soft-trim begins
hardClearRatio0–1Threshold to replace tool output with placeholder
minPrunableToolChars1K–500KSkip tiny returns

3.3 Soft-trim tuning

Keeps head/tail excerpts.

3.4 Hard clear placeholder

When enableHardClear fires:

[Cleared: tool X return (N chars). See Ledger for full output.]

3.5 Tool allow/deny lists

FieldRole
toolsAllowTrim only listed tools (empty = all)
toolsDenyNever trim (e.g. graft_get, ledger_read)

Typical toolsDeny: correctness-critical integrations (doors_query, alm_create_requirement). Leave toolsAllow empty to allow adaptive trimming broadly.

Configure LTM retrieval side (compression writes; this reads).

Memory search panel

4.1 Switches

FieldMeaning
memorySearchEnabledMaster switch
memorySourcesmemory / sessions / final_results

4.2 Chunking LTM excerpts

FieldRange
Chunk tokens50–2000 (default 512)
Overlap0–500 (default 64)

4.3 Sync cadence

FieldMeaning
Sync on session startIncremental pull each open
Sync before searchFreshest / costliest
Interval minutes1–1440
Byte / message deltasBurst triggers

4.4 Query tuning

FieldRange
queryMaxResults1–50
queryMinScore0–1

4.5 Hybrid retrieval

Rendering diagram…

5. Panel ④ Session

Session panel

5.1 sessionScope

ModeMeaningTypical
per-senderUser isolationPersonal assistant
per-agentPer-Agent threadMulti-agent desk
sharedShared conversationTeam handoff

5.2 Program memory

ToggleMeaning
Count runsInvocation stats
Task historystart/end/params
Index finalsPush answers to LTM
Max history1–500 FIFO

5.3 Reset policy

FieldMeaning
resetModeidle timeout vs daily cron
Idle minutesInactivity clears
Daily hourScheduled reset

6. Panel ⑤ Storage

Where LTM physically lives.

Storage panel

6.1 storageDriver

DriverNotes
ElasticsearchProduction default
InfinityLightweight single-node
SQLiteDev / POC

6.2 Index naming

Suggested: combo_ltm_<env>_<tenant>.

6.3 Embedding (LTM)

Separate from KB embeddings.

FieldMeaning
Model namee.g. bge-m3
DimensionsMatch model

7. Panel ⑥ Agent basics

Basics panel

FieldRangeMeaning
workspacepathSandbox root
primaryModelprovider/modele.g. openai/gpt-5
fallbackModelslistFailover chain
contextTokens1K–1MWindow budget
thinkingDefaultlow/medium/highReasoning budget
timeoutSeconds10–3600Hard cap per turn

Thinking depth strongly affects latency/cost — Fast + low vs Full + high.

8. Presets

8.1 Daily writing / Q&A

compactionMode     : default
pruningMode        : adaptive
memorySearchEnabled: true
sessionScope       : per-sender
primaryModel       : openai/gpt-5
thinkingDefault    : medium
contextTokens      : 128000

8.2 Patent / compliance (strict detail)

compactionMode     : safeguard
pruningMode        : off
memorySearchEnabled: true
memorySources      : [memory, sessions, final_results]
primaryModel       : anthropic/claude-4.7-sonnet
thinkingDefault    : high
contextTokens      : 200000
timeoutSeconds     : 600

8.3 Large-scale code review / ASPICE

compactionMode     : default
pruningMode        : aggressive
hardClearRatio     : 0.7
toolsDeny          : [graft_get, alm_create_requirement]
sessionScope       : shared

9. Save / effect

  • Click Save — orange disappears
  • Applies next turn
  • Running jobs unaffected until stopped

10. Next steps

On this page