Let correlation endpoint accept only one dataset by asizemore · Pull Request #93 · VEuPathDB/service-eda

asizemore · 2025-09-22T21:30:39Z

Resolves #92

For wgcna, we'd like the user to run correlation on 1 (self-correlation) or 2 datasets. Currently we have a separate endpoint for self-correlation.

This PR does a few things:

Renames hasSecondCollection to hasTwoCollections
Removes restrictions on data2 being null and data2 == data1
Allows for self-correlation calculations to be performed by either passing data1=data2 or data1 and data2=null.

It's looking like the frontend will be sending in data2==data1. Still, both this avenue and data2=null work and seem like reasonable ways to use the endpoint, so i'm partial to keeping both options but could be persuaded otherwise!

To test, use eda-inc and here's an example request:

{
	"config": {
		"prefilterThresholds": {
			"proportionNonZero": 0.05,
			"variance": 0,
			"standardDeviation": 0
		},
		"data1": {
			"dataType": "collection",
			"collectionSpec": {
				"entityId": "EUPATH_0000813",
				"collectionId": "EUPATH_0009252"
			}
		},
		"correlationMethod": "spearman"
	},
	"derivedVariables": [],
	"filters": [],
	"studyId": "Bangladesh_healthy_5yr-1"
}

To test data1 == data2,

{
	"config": {
		"prefilterThresholds": {
			"proportionNonZero": 0.05,
			"variance": 0,
			"standardDeviation": 0
		},
		"data1": {
			"dataType": "collection",
			"collectionSpec": {
				"entityId": "EUPATH_0000813",
				"collectionId": "EUPATH_0009252"
			}
		},
		"data2": {
			"dataType": "collection",
			"collectionSpec": {
				"entityId": "EUPATH_0000813",
				"collectionId": "EUPATH_0009252"
			}
		},
		"correlationMethod": "spearman"
	},
	"derivedVariables": [],
	"filters": [],
	"studyId": "Bangladesh_healthy_5yr-1"
}

asizemore · 2025-10-06T19:00:48Z

@bobular if possible i'd like to merge this quickly and come back to fix things in a new PR if possible. I'm trying to work on the frontend simultaneously but running the backend locally is slowing dev.
Since we'll be testing with local frontend dev in the next week, i dont think extensive testing is required.

asizemore · 2025-10-27T21:03:51Z

Don't review! I've introduced a bug and i must squash it first. Can see that now mbio correlations don't work

allow data 2 for correlation to be null

8d7b87e

asizemore marked this pull request as draft September 22, 2025 21:30

asizemore mentioned this pull request Sep 22, 2025

WGCNA: allow genomics correlation to accept self-correlation inputs VEuPathDB/web-monorepo#1382

Open

improve logic for data1=data2

9348309

asizemore marked this pull request as ready for review October 6, 2025 18:33

Merge branch 'master' into improvement-92-allow-correlations-one-dataset

374211a

asizemore requested a review from bobular October 6, 2025 18:58

asizemore marked this pull request as draft October 27, 2025 21:03

asizemore mentioned this pull request Jan 5, 2026

Feature 1382 one network viz to rule them all VEuPathDB/web-monorepo#1497

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Let correlation endpoint accept only one dataset#93

Let correlation endpoint accept only one dataset#93
asizemore wants to merge 3 commits intomasterfrom
improvement-92-allow-correlations-one-dataset

asizemore commented Sep 22, 2025 •

edited

Loading

Uh oh!

asizemore commented Oct 6, 2025

Uh oh!

asizemore commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

asizemore commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asizemore commented Oct 6, 2025

Uh oh!

asizemore commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

asizemore commented Sep 22, 2025 •

edited

Loading