-
Notifications
You must be signed in to change notification settings - Fork 9
Fix: Add load/performance/extension test groups and align env/test ex… #22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: merge-with-upstream
Are you sure you want to change the base?
Fix: Add load/performance/extension test groups and align env/test ex… #22
Conversation
| else | ||
| export JAVA_HOME=$(readlink -f /usr/bin/java | sed 's:/bin/java::') | ||
| fi | ||
| export JAVA_HOME=/usr/lib/jvm/java-8-openjdk |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
delete Auto-detect Java 8 path for different architectures?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If the /usr/lib/jvm/java-8-openjdk/ directory indeed contains JDK versions for multiple architectures, then the JDK should automatically select the appropriate version based on the architecture.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I mean this commit gphd-conf.sh is deleted automatically, set export JAVA_HOME=/usr/lib/jvm/java-8-openjdk. I mean it shouldn't be removed.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok
| return System.getProperty(AWS_ACCESS_KEY_ID); | ||
| String access = System.getProperty(AWS_ACCESS_KEY_ID); | ||
| String result = access != null ? access : System.getenv(AWS_ACCESS_KEY_ID); | ||
| System.out.println("DEBUG: doInit() method called getAccess " + result); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is a debug print?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, this is a debug print. It should be removed.
| hdfs.getWorkingDirectory() + "/" + fileName); | ||
| // load to hive table | ||
| hive.loadData(tableName, hdfsPath, false); | ||
| hive.loadData(tableName, hdfs.getWorkingDirectory() + "/" + fileName, false); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sometimes HDFS metadata loading isn't that fast. Would it be better to add hdfs.waitForFile? ensure data load successfully?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok
.github/workflows/pxf-ci.yml
Outdated
| - s3 | ||
| - features | ||
| - gpdb | ||
| - bench |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
bench runs for 5 hours. Not sure that we should run it on every PR.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@tuhaihe bench runs for 5 hours, we need to distinguish between functional testing and performance testing?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
5 hours feels too long for regular PR validation. I don't think it’s necessary to run this on every PR.
It might make more sense to separate performance/benchmark testing from functional testing, and run the benchmark as a scheduled job (for example, weekly) instead of blocking day-to-day PRs.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So we can remove it from this PR? How about it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I can try to further split the load tests and performance tests first and see how long each part takes. Then we can decide the best way to run them.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since the load test takes about 20 minutes and the performance tests take over 2 hours, we could keep just the load test for regular PR validation and move the performance tests to a separate schedule.
bc7f101 to
d474dce
Compare
…pectations - add bench_test and pxf_extension_test in run_tests.sh, plus matrix entries for bench and pxf_extension in CI - bump surefire heap to 4G to avoid OOM - update gpupgrade expected outputs to new PXF_HOME paths and JSON formatter error text - make ProtocolUtils/HiveBaseTest/JdbcHiveTest/OrcWriteTest/ParquetWriteTest more robust to env defaults (protocol, creds, hive JDBC URL) - keep MultiServerTest running under HDFS with a safe working directory fallback - set distribution key and INSERT pattern for performance test data load
d474dce to
17c8ad6
Compare
…pectations
fix #ISSUE_Number
Change logs
Contributor's checklist
Here are some reminders before you submit your pull request: