addressing #68, #69, #71, #95, and #96 for the kalman module #98

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

pavelkomarov merged 6 commits into master from kalman-docstrings-and-doublecheck

Jun 27, 2025

Collaborator

pavelkomarov commented Jun 21, 2025

The tests are definitely going to fail, because I identified at least one bug that changes core logic, so the results will be a tiny bit numerically different. We'll check against improved unit tests once that's merged. This also addresses #97 some, but I'm afraid that one will take significantly more work if we want to actually properly extend this module. Even the linear KF in this module isn't calculating a derivative (mistake?), which makes me extra suspicious that it hasn't had any love in a long time.


          addressing #68 for the kalman module, discovering bugs in the process

1392e48

pavelkomarov commented

View reviewed changes

pynumdiff/finite_difference/_finite_difference.py Outdated

    
                           - **x_hat** -- estimated (smoothed) x

                           - **dxdt_hat** -- estimated derivative of x

                  """

                  w = np.arange(len(x)) / (len(x) - 1) # set up weights, [0., ... 1.0]

Collaborator Author

pavelkomarov Jun 21, 2025

There's an easier way.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
              ####################

              def __kalman_forward_update__(xhat_fm, P_fm, y, u, A, B, C, R, Q):

Collaborator Author

pavelkomarov Jun 21, 2025 •

edited

Loading

I found making this a separate function to be unnecessary; its logic now lives in the forward filter.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  else:

                      xhat_fp = xhat_fm

                      P_fp = (I - K_f@C)@P_fm

                      xhat_fm = A@xhat_fp + B@u

Collaborator Author

pavelkomarov Jun 21, 2025

Some duplicated lines in here. I got rid of them.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  :return:

                  """

                  I = np.array(np.eye(A.shape[0]))

                  gammaW = np.array(np.eye(A.shape[0]))

Collaborator Author

pavelkomarov Jun 21, 2025

What is this supposed to do? It was doing nothing, so I got rid of it.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                      - **xhat_pre** -- a priori estimates of xhat, with axis=0 the batch dimension, so xhat[n] gets the nth step

                      - **xhat_post** -- a posteriori estimates of xhat

                      - **P_pre** -- a priori estimates of P

                      - **P_post** -- a posteriori estimates of P

Collaborator Author

pavelkomarov Jun 21, 2025

I've renamed all these to be a little more explicit about what they are.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  :param np.array u: optional control input

                  :param np.array B: optional control matrix

                  :return: tuple[np.array, np.array, np.array, np.array] of\n

                      - **xhat_pre** -- a priori estimates of xhat, with axis=0 the batch dimension, so xhat[n] gets the nth step

Collaborator Author

pavelkomarov Jun 21, 2025

I many places you weren't using 0 as the batch dimension. But axis 0 should always be the batch dimension so you can pull out a sample without having to do complicated slicing.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                      xhat_fp = xhat_fm + K_f@(y - C@xhat_fm)

                      P_fp = (I - K_f@C)@P_fm

                      xhat_fm = A@xhat_fp + B@u

                      P_fm = A@P_fp@A.T + gammaW@Q@gammaW.T

Collaborator Author

pavelkomarov Jun 21, 2025

The order of these calls is inverted from how I tend to think about the KF. You can start on either the prediction or the update side, but I start on prediction in the paper, and so do the standard Wikipedia and many other pseudocodes for the KF. Thus, your _fm variables here contain the next time step's prediction, but they're indexed at this time step, which causes confusion when you go do your RTS implementation.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  xhat_smooth = copy.copy(xhat_fp)

                  P_smooth = copy.copy(P_fp)

                  for t in range(N-2, -1, -1):

                      L = P_fp[t]@A.T@np.linalg.pinv(P_fm[t])

Collaborator Author

pavelkomarov Jun 21, 2025

So this line is actually correct, because P_fm[t] holds $P_{t+1|t}$.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  for t in range(N-2, -1, -1):

                      L = P_fp[t]@A.T@np.linalg.pinv(P_fm[t])

                      xhat_smooth[:, [t]] = xhat_fp[:, [t]] + L@(xhat_smooth[:, [t+1]] - xhat_fm[:, [t+1]])

                      P_smooth[t] = P_fp[t] - L@(P_smooth[t+1] - P_fm[t+1])

Collaborator Author

pavelkomarov Jun 21, 2025 •

edited

Loading

But this line is incorrect, because you're supposed to use $P_{t+1|t}$ here as well, which is stored in P_fm[t]. https://arc.aiaa.org/doi/10.2514/3.3166

This line is also wrong because you're subtracting the L part instead of adding it. So this isn't doing RTS smoothing at all. Serious bug.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
              #####################

              def __constant_velocity__(x, dt, params, options=None):

Collaborator Author

pavelkomarov Jun 21, 2025 •

edited

Loading

You don't need all these shadow functions behind the front-facing ones, because you should really only define the matrices once and then call smoothing twice. As written, the matrices were getting redefined for the forward and backward passes separately, extra work. So the two levels got combined across all these methods.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py Outdated

    
              #########################################

              # Constant 1st, 2nd, and 3rd derivative #

              #########################################

              def _constant_derivative(x, P0, A, C, R, Q, forwardbackward)

Collaborator Author

pavelkomarov Jun 21, 2025

So so so so much code in this module was just copy and pasted, and the structure to call things either forward or forward and backward was repeated unnecessarily, so I've put that logic in a single place.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  x_hat_forward = xhat_smooth[:, 0] # first dimension is time, so slice first element at all times

                  dxdt_hat_forward = xhat_smooth[:, 1]

                  if not forwardbackward: # bound out here if not doing the same in reverse and then combining

Collaborator Author

pavelkomarov Jun 21, 2025

It's very odd to me that the Kalman with RTS smoothing already works forward and backward, but then you invert the dynamics and go again and then take a weighting of the answers. Why? I thought RTS smoothing was already supposed to be optimal. Did it not seem to work at one point, and these were further hacks? I wonder if having the bugs in the RTS smoother fixed will make the hacks unnecessary.

Collaborator Author

pavelkomarov Jun 26, 2025 •

edited

Loading

The answer is that a non-optimal guess to kick off the process biases the end you start on, and it takes a while for that bias to wear off. Classic RTS (even sans bugs) doesn't fully address this, but doing two of them and weighting together does.

Ideally, you could choose an initial x0,P0 that wouldn't horribly bias the process, but due to $C$ not being full rank, there is no way to choose the single optimal x0,P0. See my ponderings on #97.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py Outdated

    
                  x_hat = x_hat_forward*w + x_hat_backward*(1-w)

                  dxdt_hat = dxdt_hat_forward*w + dxdt_hat_backward*(1-w)

                  dxdt_hat_corrected = np.mean((dxdt_hat, dxdt_hat_forward), axis=0) # What is this line for?

Collaborator Author

pavelkomarov Jun 21, 2025

This strikes me as a hack. I don't like it.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  :type params: dict {'backward': boolean}, optional

                  :return: a tuple consisting of:

              def constant_acceleration(x, dt, params=None, options=None, r=None, q=None, forwardbackward=True):

Collaborator Author

pavelkomarov Jun 21, 2025 •

edited

Loading

Unfortunately, these function signatures repeat a bunch of docstring and input-checking logic, but I can't simplify further without breaking backwards compatibility. I'd prefer a single constant_derivative function that takes order as a parameter.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  :return: matrix:

                          - xhat_smooth: smoothed estimates of the full state x

                  :rtype: tuple -> (np.array, np.array)

Collaborator Author

pavelkomarov Jun 21, 2025

The docstring indicates here and in its topline by use of the words "to estimate the derivative" that this function should be calculating a derivative, but it's not. This begs the question: Does anybody use this, since it went unnoticed? Can I break backwards compatibility for this function alone?

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
              ###################################################################################################

              # Constant Acceleration with Savitzky-Golay pre-estimate (not worth the parameter tuning trouble) #

Collaborator Author

pavelkomarov Jun 21, 2025 •

edited

Loading

If this is true, then we should nuke this code per #95. If we ever wish to recover the experiment, we can go back to an ancient tagged commit and fish it out of the nether.


          now the tests'll run

a4d9697

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  if len(x.shape) == 2:

                      pass

                  else:

                      x = np.reshape(x, [1, len(x)])

Collaborator Author

pavelkomarov Jun 21, 2025

It's not necessary to reshape 1D vectors into vertical 2D column vectors, because A @ x will do the exact same thing in either case.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  xhat_fp = None

                  P_fp = []

                  P_fm = [P_fm]

Collaborator Author

pavelkomarov Jun 21, 2025 •

edited

Loading

This initialization is slightly odd, because it says x0, P0 are the a priori estimate for the current time based on the previous time, rather than considering them the a posteriori estimate from last time. The difference is that last time's a posteriori estimate would be propagated forward before projection through $C$ to produce an expected measurement for combination with the first measurement, whereas an a priori estimate already associated with the current time would simply be projected through $C$.

What makes more sense when we ask the user for an initial condition? "Tell us where you think the system is, one time step before seeing any measurements" or "Tell us where you think the system is at the moment of the first measurement"? Six on one hand, half dozen on the other. I spent a good while in #97 trying to think of a good way to initialize the cycle, but any projection from measurement space to full state space requires the measurement space to have the same dimension as the state space, which just isn't our case at all for all the constant derivative stuff, where the dimension of x goes up to the $\nu^\text{th}$ derivative but the dimension of y is always 1.

In the end I decided I prefer to project our identity matrix P0 guess through $A$ and then add $Q$ before projecting through $C$ to get the expected covariance, if only because it agrees with what I've written in the paper and most explanations I've ever seen.

Collaborator Author

pavelkomarov Jun 21, 2025

If we end up preferring the opposite, it's easy to change _kalman_forward_filter.

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  :param Q:

                  :return:

                  if B is None: B = np.zeros((A.shape[0], 1))

                  if u is None: u = np.zeros(B.shape[1])

Collaborator Author

pavelkomarov Jun 21, 2025

u can safely be 1D.

pavelkomarov changed the title ~~addressing #68 for the kalman module, discovering bugs in the process~~ addressing #68 and #69 for the kalman module, discovering bugs in the process

pavelkomarov changed the title ~~addressing #68 and #69 for the kalman module, discovering bugs in the process~~ addressing #68, #69, and #71 for the kalman module, discovering bugs in the process


          velocity to accel and jerk

697d8d4

pavelkomarov mentioned this pull request

Constant acceleration Kalman state evolution matrix can go up to second order in first row #96

Closed

pavelkomarov commented

View reviewed changes

pynumdiff/kalman_smooth/_kalman_smooth.py

    
                  elif r == None or q == None:

                      raise ValueError("`q` and `r` must be given.")

                  A = np.array([[1, dt, (dt**2)/2], # states are x, x', x"

Collaborator Author

pavelkomarov Jun 23, 2025

I'm using a more exact A here, per #96

pavelkomarov changed the title ~~addressing #68, #69, and #71 for the kalman module, discovering bugs in the process~~ addressing #68, #69, #71, #95, and #96 for the kalman module, discovering bugs in the process

pavelkomarov mentioned this pull request

finite difference unit tests now live in test_diff_methods, the way I'm checking bound… #100

Merged

pavelkomarov changed the title ~~addressing #68, #69, #71, #95, and #96 for the kalman module, discovering bugs in the process~~ addressing #68, #69, #71, #95, and #96 for the kalman module


          Merge branch 'master' of github.com:florisvb/PyNumDiff into kalman-do…

fa8fbdd

…cstrings-and-doublecheck


          Merge branch 'smooth-fd-docstrings' of github.com:florisvb/PyNumDiff …

6b8d8e6

…into kalman-docstrings-and-doublecheck

Collaborator Author

pavelkomarov commented Jun 25, 2025 •

edited

Loading

Okay, reporting in about the "correction" happening in the Kalman smooth functions, which I thought shouldn't be there and was maybe the result of a hack to compensate for an RTS smoothing bug.

Here is constant_velocity on the first two test functions with the "correction" still in my code:

The blue line is the function, and the yellow line is its derivative.

And here is the performance if I remove the correction and just stick to weighting the forward and backward passes of RTS:

And just for completeness, here is the result again without the correction, but when I set forwardbackward=False:

I'll compare performance for constant_acceleration and constant_jerk as well to see if this pattern holds.

Collaborator Author

pavelkomarov commented Jun 25, 2025

Here is the full plot for constant_acceleration with the "correction":

And here is without the correction:

Without a doubt, without is better. I'm going to remove the correction in this PR.


          getting kalman module up to speed on new, passing tests

26dd13b

Collaborator Author

pavelkomarov commented Jun 25, 2025 •

edited

Loading

This one now makes use of the changes from #102, so order of operations says that one should be reviewed and merged first. Then we can git pull origin master to this branch to rebase and exclude those changes from this review.

pavelkomarov mentioned this pull request

Support params as a dictionary. #68

Closed

pavelkomarov changed the base branch from master to smooth-fd-docstrings

June 26, 2025 23:57

pavelkomarov mentioned this pull request

Shorten docstrings #71

Closed

pavelkomarov changed the base branch from smooth-fd-docstrings to master

June 27, 2025 18:25

pavelkomarov merged commit e7026bd into master

1 check passed

This was referenced Jun 27, 2025

RTS Smoothing bug #109

Closed

Kalman Filter initialization #110

Closed

pavelkomarov deleted the kalman-docstrings-and-doublecheck branch

June 27, 2025 21:13

pavelkomarov mentioned this pull request

Kalman forward filter causing jaggedness in example notebook #122

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet