Add Pythonx.install_env/0 and Pythonx.install_paths/0 to enable FLAME #35

mruoss · 2025-11-06T19:29:35Z

This is a proposition / food for discussion at this point - coming over form the discussion here: https://elixirforum.com/t/is-there-a-way-to-access-pyproject-toml/73129.

In order to support Livebook+Pythonx+FLAME, somethink like this would be a first step. We'd still have to somehow get that struct/data over to the FLAME and initialize Pythonx in Livebook's /rel/server/overlays/bin/start_flame.exs. Since this can only be done once, we can't do it inside the FLAME.call function as FLAMEs can be reused...

WDYT?

mruoss · 2025-11-07T08:04:38Z

Actually, I'm not so sure about the initializing once part. Especially in FLAME it can be that you reuse the FLAME but you'd want to initialize a different pyproject.toml / venv than the last user of the FLAME. Currently, if I understand correctly, that is not possible. Initializing Python and initializing UV seem to be very entangled, right?

jonatanklosko · 2025-11-20T20:40:54Z

Hey, sorry for the late reply.

Another approach that could work is for Livebook to set an application env with the pyproject toml, then when Pythonx boots on another node and the env is set, it would do the initialization. We already support such application env, but it is compile time (if we don't set the env, that code path doesn't exist at all):

pythonx/lib/pythonx/application.ex

Lines 30 to 34 in 57b5398

    
           if pyproject_toml do 
        
             Pythonx.Uv.fetch(pyproject_toml, true, opts) 
        
             defp maybe_uv_init(), do: Pythonx.Uv.init(unquote(pyproject_toml), true, unquote(opts)) 
        
           else 
        
             defp maybe_uv_init(), do: :noop

cc @josevalim

Actually, I'm not so sure about the initializing once part. Especially in FLAME it can be that you reuse the FLAME but you'd want to initialize a different pyproject.toml / venv than the last user of the FLAME. Currently, if I understand correctly, that is not possible. Initializing Python and initializing UV seem to be very entangled, right?

Initializing Pythonx itself means starting the interpreter and setting up code paths. uv is a way to get Python+dependencies. So those are distinct, but since currently we only support initializing Pythonx with uv provided Python+dependencies, they are entangled.

Within a single node, Pythonx can only be initialized once. Theoretically there is a way to uninitialize the interpreter, but in practice this doesn't work, because many libraries with native code would break (e.g. numpy).

mruoss · 2025-11-21T20:47:49Z

Thanks for the reply and don't worry about the delay!

Another approach that could work is for Livebook to set an application env with the pyproject toml, then when Pythonx boots on another node and the env is set, it would do the initialization.

Okay, yes, this could also be built into Pythonx of course. Either way, in the case of FLAME, passing the ENV variable would have to be a manual configuration on the FLAME pool...

Within a single node, Pythonx can only be initialized once.

Okay, if I understand correctly, one FLAME runner / BEAM node can only be used for a single python project config (i.e. venv, i.e. Python+dependencies)? I mean... this limitation is probably okay if we can make sure Pythonx is actually initialized on the FLAME.

The thing that's bothering me a bit is: ENV variables are configured on a FLAME pool. But the limitation about Pythonx actually applies to a runner, not a pool. Each fresh runner of the same pool could technically be initialized with a different pyproject.toml. But... if we have to pass pyproject.toml as ENV var, we have to configure it on the pool... Then again... maybe that's only a theoretical problem and in practice, when using it with Livebook, you'd use one FLAME pool for one pyproject.toml...

jonatanklosko · 2025-11-21T20:59:37Z

@mruoss to be clear, I mean pythonx application env, not an env var. The application envs are automatically copied by FLAME to the runner :)

mruoss · 2025-11-21T21:02:43Z

Oooh I see! Yes this sounds like a good approach! I can play around with this a bit.

mruoss · 2025-11-21T21:34:37Z

Hmm... maybe I'm still not getting it.

The application envs are automatically copied by FLAME to the runner :)

Aren't we talking about the following?

Mix.install([{:pythonx, "~> 0.4.2"},  {:flame, "~> 0.1.5"}, {:flame_k8s_backend, "~> 0.5"}])
# ... initialize FLAME pool ...
Application.put_env(:pythonx, :pyproject_toml, "just a test")
FLAME.call(:runner, fn -> Application.get_env(:pythonx, :pyproject_toml) end)
# returns nil, was expecting "just a test"

josevalim · 2025-11-21T21:41:53Z

I think we copy the .app files but not the runtime values (but I may be misremembering). So anything dynamic won't work indeed.

jonatanklosko · 2025-11-24T11:02:08Z

Gah, sorry, I thought the persistent config is copied too but I misremembered.

jonatanklosko · 2025-11-24T12:22:46Z

But I think we do need to pass the information for boot in some way. @josevalim perhaps we can do explicit and pass extra config on flame pool to set extra application env?

josevalim · 2025-11-24T13:11:57Z

@jonatanklosko another option is to pass Pythonx as additionals path to be copied to FLAME and, because we should cache everything, then it just works on the FLAME?

jonatanklosko · 2025-11-24T15:20:50Z

@josevalim that's a separate point, ideally we want to copy to effectively get cache hit, but the question is how does pythonx on the new node know that it should initialize the interpreter.

josevalim · 2025-11-24T15:54:26Z

It would be great if we could store it in the directory we copy but I am afraid it is not straightforward. We could also look at system env vars, but I don’t think they are copied either?

josevalim · 2025-11-24T15:58:03Z

Or we could copy to priv, which may have other side effects, so perhaps yeah, we need new capabilities in FLAME.

jonatanklosko · 2025-11-24T16:09:04Z

We could also look at system env vars, but I don’t think they are copied either?

Env vars that we set dynamically are not copied. We could pass it explicitly to the pool via :env option, though it's a bit awkward to encode all necessary information in an env var. Perhaps it would be very specific like PYTHONX_FLAME_INIT_STATE with base64 encoded value. But then a better alternative would be to allow setting app config, so at least it's more structured.

Or we could copy to priv, which may have other side effects, so perhaps yeah, we need new capabilities in FLAME.

You mean the cache is still somewhere global, but FLAME copies it to Pythonx priv on the new node? It's even more implicit, since Pythonx would need to infer if/how it should initialize based on the priv contents. We also already use priv/ if compile-time uv_init: [pyproject_toml: ...] is set, so that it works in releases, so we would need to distinguish between these too.

So currently I am leaning towards adding application env config to FLAME, and perhaps a function like Pythonx.flame_init_econfig() (name either tied or not tied to flame).

josevalim · 2025-11-24T17:45:40Z

Right, so any of app env or system env are fine. FLAME already supports system env but adding app env should be trivial.

jonatanklosko · 2025-11-24T18:24:02Z

@josevalim actually, in both cases we can make it opaque if we don't expose the key name:

[
  env: ... |> Map.merge(Pythonx.flame_env())
]

# vs

[
  config: [
    pythonx: Pythonx.flame_config()
  ]
]

Then we can have a very verbose env var name with encoded state and it's all private API. Then I am fine with using env vars.

josevalim · 2025-11-24T19:17:23Z

as you prefer!

jonatanklosko · 2025-11-25T20:27:17Z

@mruoss so in case you want to try the alternative approach, the idea is to have Pythonx.flame_env() => %{"PYTHONX_FLAME_INIT_STATE" => "..."}. The value can be something like %{type: :uv_init, pyproject_toml: "...", ...} |> :erlang.term_to_binary() |> Base.encode64(). Then at boot we check for that env var and call uv_init accordingly.

mruoss · 2025-11-26T17:33:50Z

That makes sense and I can definitely try this approach. But where would Pythonx keep this state? I.e. where would Pythonx.flame_env() take it from?

jonatanklosko · 2025-11-26T18:52:25Z

We can store in persistent term upon initialization :)

mruoss · 2025-11-29T15:40:02Z

The approach in this pull request defines init_state by the paths rather than the pyproject_toml. This way we can also use it to actually copy the correct paths over to the FLAME.

I pushed some changes to explain the approach. There's 2 exported functions on this branch now Pythonx.flame_env() => env to copy to the runner and Pythonx.flame_paths_to_copy() => paths to copy to the flame.

Does that make sense?

josevalim · 2025-11-29T16:05:23Z

What if we call it install_env and install_paths?

mruoss · 2025-11-29T22:09:54Z

I will leave the decision about function naming up to you guys. With the current version of this PR, I can successfully launch a FLAME and initialize Pythonx on it. Note that the wildcard part is only needed until a new version of FLAME is released with phoenixframework/flame@6d62b8a

Once we're aligned about the naming(s), the next step would be to adapt livebook's start_flame.exs to initialize Pythonx if the env variable is set.

Once that is done, users still need to manually pass the paths and env to FLAME. Using the k8s backend, this would look something like the following in a Livebook:

import YamlElixir.Sigil

pythonx_env = Pythonx.install_env()

pod_template = ~y"""
apiVersion: v1
kind: Pod
metadata:
  generateName: livebook-flame-runner-
  namespace: livebook
spec:
  containers:
    - name: livebook-runtime
      env:
        - name: LIVEBOOK_COOKIE
          value: #{Node.get_cookie()}
        - name: #{pythonx_env.name}
          value: #{pythonx_env.value}
"""

Kino.start_child(
  {FLAME.Pool,
   name: :runner,
   code_sync: [
     start_apps: true,
     copy_apps: true,
     sync_beams: Kino.beam_paths(),
     copy_paths: Pythonx.install_paths() # <= this is where we copy the paths / cache over to the FLAME
   ],
   backend: {FLAMEK8sBackend, runner_pod_tpl: pod_template}}
   # other options
)

mruoss · 2025-12-01T20:39:34Z

Actually, I would do it within Pythonx on application boot:

@jonatanklosko Oh of course, this makes sense! I've pushed some changes now and it works like a charm. Now I don't have to init Pythonx inside the FLAME.call() callback anymore and I can re-use the same runner for multiple consecutive calls (as long as python_toml doesn't change). 🎉

Yes, exactly. It should be a FLAME feature to make everyone's lives easier.

@josevalim I agree, it would be nice if this didn't have to be configured on the backend (k8s pod template in my example). But I'm not sure I fully get you. Are you saying you'd like both, the path and env sync to happen without the need for the user to specify anything on either pool nor backend?

josevalim · 2025-12-01T21:10:54Z

I am saying it should only be specified at the FLAME level. But we need a good name for it, because the variable is only set after the Erlang runtime boots, unless we add a new responsibility to the FLAME backends. We will have to play with it from the FLAME side!

mruoss · 2025-12-01T21:33:46Z

Okay yes I see your point. Maybe we're abusing system env here. This is FLAME specific so maybe it can be a bit more "custom tailored", no? What if FLAME provides an API similar to Agent... something built on top of :persistent_term with the key :flame_sync and a map that can be filled? FLAME could sync that to the runner and :persistent_term.put() it there upon init...

josevalim · 2025-12-02T11:30:47Z

Honestly, those are all the same. I think we will only know the answer for sure once we play with FLAME API. It may be the system environment makes the most sense, as long as we allow the backend to set it before the machine starts, then it can be used for other use cases. But we need to consult with the backends. I assume for k8s that would be straightforward?

mruoss · 2025-12-02T11:42:18Z

Haha, yeah I had the same thought after I wrote it (all of them being the same). So let me play with FLAME a bit when I get to it.

But we need to consult with the backends. I assume for k8s that would be straightforward?

You mean passing all (or a set of) system env vars? that's pretty straightforward, yes.

josevalim · 2025-12-02T17:00:18Z

@mruoss an :env key is already supported as a backend option for Fly:

              backend: {FLAME.FlyBackend,
                cpu_kind: "performance", cpus: 4, memory_mb: 8192,
                token: System.fetch_env!("FLY_API_TOKEN"),
                env: Map.take(System.get_env(), ["LIVEBOOK_COOKIE"]),
              },

So we just need the same for k8s?

mruoss · 2025-12-02T19:47:37Z

Oh well if this is what you're talking about: Any FLAME backend that wants to support Livebook already needs support for defining env variables on the runner "machine". Otherwise you couldn't pass LIVEBOOK_COOKIE.

The k8s backend supports 2 ways of defining the Pod manifest for the FLAME runner (See detailed docs on hexdocs):

You can pass the full Pod manifest YAML which gives you the freedom to use all features k8s pod config offers - env vars included.
There's a simplified abstraction over the pod manifest which let's you configure :env, :resources and a few other things.

josevalim · 2025-12-02T20:19:33Z

So I think it is all good. Worst case scenario we will need to pass the manifest and :env and have them merged together but I assume that's doable?

mruoss · 2025-12-02T20:34:14Z

Again I'm not sure I understand. With he k8s backend it's an either/or. You either use the abstraction and pass :env, OR you pass the pod manifest yaml with env variables configured (like in the snipped above)

josevalim · 2025-12-02T21:59:08Z

@mruoss yes, I understand that. All I am saying is that it would be nice to do:

{Flame.K8sBaclend,
  env: %{"LIVEBOOK_COOKIE" => ...} |> Map.merge(Pythonx.install_env()),
  runner_pod_template: """..."""}

So we are not polluting the pod template with Pythonx or Livebook vars, and the user can focus on their stuff. That's what I meant by a unified env var. It is not the end of the world though, if we can't have both, that's fine, and kino_flame can be smart and detect Pythonx is there and generate the proper stuff anyway.

jonatanklosko

Awesome! I dropped some comments regarding the API, and then we can ship it.

lib/pythonx.ex

mruoss · 2025-12-04T20:58:35Z

@josevalim I understand now. And I'm torn between "yes that would be nice indeed" and "but then we define env vars in 2 places, we need to clarify precedence,...". But I'll have to give it some more thought.

@jonatanklosko Thanks for the code review. All makes sense. Not 100% sure about the root_dir. In my test it now takes quite long to create the FLAME pool because the list of files is very long.

jonatanklosko · 2025-12-04T21:06:48Z

@mruoss interesting, root_dir is effectively the same as python_home_path, which the current version does copy, or am I missing something being skipped?

mruoss · 2025-12-05T19:47:05Z

Sorry, something got messed in my previous commit. Now it should reflect the correct changes.

interesting, root_dir is effectively the same as python_home_path, which the current version does copy, or am I missing something being skipped?

Hmm yes that's true. Still, do we really need to copy the entire cpython-* folder with all its headers etc?

jonatanklosko · 2025-12-08T16:22:38Z

Hmm yes that's true. Still, do we really need to copy the entire cpython-* folder with all its headers etc?

The headers presumably not, since if anything compiles on the host, it is then reused on the nodes. But the headers are small compared to lib/, which is most of the space; it has dynamic libraries and those we should likely keep as is in the general case.

lib/pythonx.ex

jonatanklosko

Mostly docs updates, but otherwise looks good to me!

lib/pythonx/application.ex

lib/pythonx.ex

lib/pythonx/application.ex

jonatanklosko

Fantastic, looks good to me!!

josevalim · 2025-12-09T15:48:48Z

Flame v0.5.3 is out :)

make uv_init return a struct with required paths

d78009c

mruoss force-pushed the python-config-struct branch from 82015a3 to fa955d4 Compare November 29, 2025 15:37

mruoss force-pushed the python-config-struct branch from fa955d4 to 4ba6784 Compare November 29, 2025 15:46

mruoss force-pushed the python-config-struct branch 4 times, most recently from 746a9fb to 9eee00a Compare November 29, 2025 22:02

mruoss force-pushed the python-config-struct branch from 9eee00a to 50575e7 Compare December 1, 2025 20:32

store init_state as persistent_term and add flame_env() to retrieve it

e6d2176

mruoss force-pushed the python-config-struct branch from 50575e7 to e6d2176 Compare December 1, 2025 20:44

jonatanklosko reviewed Dec 3, 2025

View reviewed changes

lib/pythonx.ex Outdated Show resolved Hide resolved

lib/pythonx.ex Outdated Show resolved Hide resolved

lib/pythonx.ex Outdated Show resolved Hide resolved

mruoss force-pushed the python-config-struct branch from 9e1556f to e7bfc0e Compare December 5, 2025 15:32

jonatanklosko reviewed Dec 8, 2025

View reviewed changes

lib/pythonx.ex Outdated Show resolved Hide resolved

jonatanklosko reviewed Dec 8, 2025

View reviewed changes

mruoss force-pushed the python-config-struct branch from e7bfc0e to acc7245 Compare December 8, 2025 20:32

jonatanklosko reviewed Dec 8, 2025

View reviewed changes

lib/pythonx/application.ex Outdated Show resolved Hide resolved

jonatanklosko approved these changes Dec 8, 2025

View reviewed changes

jonatanklosko changed the title ~~Make uv_init return a struct with all required paths to init Pythonx~~ Add Pythonx.install_env/0 and Pythonx.install_paths/0 to enable FLAME Dec 8, 2025

mruoss force-pushed the python-config-struct branch from 64ed0bc to df69451 Compare December 9, 2025 11:41

refactoring

90696dc

mruoss force-pushed the python-config-struct branch from df69451 to 90696dc Compare December 9, 2025 11:45

jonatanklosko merged commit 51e5cfb into livebook-dev:main Dec 9, 2025
9 checks passed

Add Pythonx.install_env/0 and Pythonx.install_paths/0 to enable FLAME #35

Add Pythonx.install_env/0 and Pythonx.install_paths/0 to enable FLAME #35

Uh oh!

Conversation

mruoss commented Nov 6, 2025

Uh oh!

mruoss commented Nov 7, 2025

Uh oh!

jonatanklosko commented Nov 20, 2025

Uh oh!

mruoss commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jonatanklosko commented Nov 21, 2025

Uh oh!

mruoss commented Nov 21, 2025

Uh oh!

mruoss commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josevalim commented Nov 21, 2025

Uh oh!

jonatanklosko commented Nov 24, 2025

Uh oh!

jonatanklosko commented Nov 24, 2025

Uh oh!

josevalim commented Nov 24, 2025

Uh oh!

jonatanklosko commented Nov 24, 2025

Uh oh!

josevalim commented Nov 24, 2025

Uh oh!

josevalim commented Nov 24, 2025

Uh oh!

jonatanklosko commented Nov 24, 2025

Uh oh!

josevalim commented Nov 24, 2025

Uh oh!

jonatanklosko commented Nov 24, 2025

Uh oh!

josevalim commented Nov 24, 2025

Uh oh!

jonatanklosko commented Nov 25, 2025

Uh oh!

mruoss commented Nov 26, 2025

Uh oh!

jonatanklosko commented Nov 26, 2025

Uh oh!

mruoss commented Nov 29, 2025

Uh oh!

josevalim commented Nov 29, 2025

Uh oh!

mruoss commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mruoss commented Dec 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josevalim commented Dec 1, 2025

Uh oh!

mruoss commented Dec 1, 2025

Uh oh!

josevalim commented Dec 2, 2025

Uh oh!

mruoss commented Dec 2, 2025

Uh oh!

josevalim commented Dec 2, 2025

Uh oh!

mruoss commented Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

josevalim commented Dec 2, 2025

Uh oh!

mruoss commented Dec 2, 2025

Uh oh!

josevalim commented Dec 2, 2025

Uh oh!

jonatanklosko left a comment

Choose a reason for hiding this comment

Uh oh!

mruoss commented Nov 21, 2025 •

edited

Loading

mruoss commented Nov 21, 2025 •

edited

Loading

mruoss commented Nov 29, 2025 •

edited

Loading

mruoss commented Dec 1, 2025 •

edited

Loading

mruoss commented Dec 2, 2025 •

edited

Loading