Design `forecaster` package

From @houtanb 

> my plan for this repo (we need buy in for this) is to have another folder, something like `forecaster`. That would have a default forecaster that is the ForecastBench zero shot forecaster, with
> * a complete list of models ever run on FB (like you've started here)
> * the parameters they were run with
> * the zero shot prompt
> so that FRI staff could easily use a FB forecaster in their code.

> But, that `forecaster` library would also:
> * need to obtain structured output to get forecasts in a consistent way or parse LLM output to provide this (as is currently done on FB)
> * handle forecasts on binary questions as with FB, but also quantile forecasts, point forecasts, multiple choice, ...

> This `llm` library is step on which the `forecaster` library can be built. But, as is, it's already useful to peolpe at FRI as you've created this great abstraction from all these different APIs such that anyone at FRI can query any model they want without having to look into a specific API. 

> Until we get that `forecaster` library, the list [in llm/model_regsitry.py] would need to be maintained for not much benefit. Also, the list is not complete as many models have been run that are not present

> FYI `MODELS_TO_RUN` on FB is usually updated at least once every two weeks with new models or pulling in models we've previously run. Just updated today in https://github.com/forecastingresearch/forecastbench/commit/cdc0f9ccbf36109266d06a00f375a1aed36bd491 and will update again when we have access to GPT 5.1 via the API.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Design `forecaster` package #10

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Design forecaster package #10

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Design `forecaster` package #10