-
Notifications
You must be signed in to change notification settings - Fork 20
Add btest-diff-cut
#125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add btest-diff-cut
#125
Conversation
bc8846f to
76cc235
Compare
3ac6a15 to
96bc2e5
Compare
96bc2e5 to
c8bd835
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Main feedback is for removing the -f requirement and naming the env variable TEST_DIFF_CUTTER so it looks good when used together with TEST_DIFF_CANONIFIER in one line.
The other comments are pretty nitty: EAFP and naming.
c8bd835 to
5604d67
Compare
5604d67 to
703680b
Compare
|
Very nitty: The commit message mentiones |
a6ec62f to
95bc091
Compare
awelzel
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, but should maybe have a look from @rsmmr or @ckreibich . I think this will help a lot. Thanks for doing the work!
This is basically _what if `zeek-cut` and `btest-diff` had a baby_. The intended use case is to give users an easy to use tool which helps avoid baselining full Zeek logs.
I also fixed deprecation warnings around how we specify the license in `pyproject.toml`.
95bc091 to
975789f
Compare
|
I think it would be better for this to live inside the Zeek tree. The use case is very Zeek-specific, and then we cut just hardcode it for what it's needed there (i.e., using zeek-cut). |
As written this is actually more generic than the name |
|
Looking at this again I have a question — take this example from one of our btests: The idea is that we'd save the detour through the .cut file as follows, yes? If so, this seems very similar to saying so I wonder whether this is worth it. If you guys think so, it's fine with me, but then I'm inclined to agree with Robin to make this fully about zeek-cut, which seems will likely be the bulk of the use. |
Yes, exactly that. The only real difference is that this script would allow putting
The real question is why even with this available users are still tripped up by us adding columns to logs since their tests baseline arbitrary logs. The idea here was to make being explicit about "interesting data" (and by extension: the expected schema) more "natural" so users are nudged towards writing more robust tests. I will close this, but we should probably still think about how to address this (more documentation, updated templates, ...?). |
😞 The usage in Zeek or packages would be more like this: And yes, I think the reduction in noise makes a difference if used all over the place - particularly if you want a second cannonifier and pipes. I'd actually rather have users come up with their own helpers than us recommending the second way. |
This is basically what if
zeek-cutandbtest-diffhad a baby. The intended use case is to give users an easy to use tool which helps avoid baselining full Zeek logs.