-
Notifications
You must be signed in to change notification settings - Fork 215
Description
Hello team,
I noticed that the timestamps recorded in episodic_memories currently have quite inconsistent formats. Here are some examples from the data:
2026年1月22日11:25:33+00:002026年1月22日11:25:33+00:00:002026年1月22日11:272026年1月22日11:27:55 UTC2026年1月23日02:19:25 UTC2026年1月23日02:19:25+00:002026年1月23日 (周五) 上午10:07
These variations include different problems such as:
- Chinese vs English date expressions
- With/without weekday (周五)
- With/without timezone (UTC / +00:00 / +00:00:00)
- Different precision levels (missing seconds, etc.)
- Mixing full datetime and partial time
This inconsistency makes it more difficult and error-prone for later parsing, sorting, filtering, and time-based operations.
From a quick look at the code, it seems the format is mainly controlled by the prompt in episode_mem_prompts.py.
Suggestion
We could strengthen the prompt to enforce a single, standardized format, for example strict ISO 8601:
- Recommended format:
2026-01-23T02:19:25Z
or2026-01-23T10:07:00+08:00
Example instruction we could add to the prompt:
"Always output timestamps in strict ISO 8601 format with timezone, e.g. 2026-01-23T02:19:25Z. Do not use Chinese characters, weekdays, or any other non-standard format."
Questions / Next steps
- Would updating the prompt in
episode_mem_prompts.pyto enforce consistent ISO 8601 format be a safe and effective solution? - Could this change potentially break any existing code that is already tolerant of multiple formats?
- Should we consider adding a lightweight normalization function as a fallback (in case the model sometimes ignores the prompt)?
Thank you for your time!
