1. We need to reserve a place (1/3 height of screen) as the timeline 2. Each time we capture, a smaller/visualization of item will be shown in this timeline (queue up) 3. (Optional) if the timeline is draggable 4. Finish button at the every end of the timeline, to pack up everything, and get ready feeding into ChatGPT