-
Notifications
You must be signed in to change notification settings - Fork 4
Description
Hello,
this is a great idea, been looking to try something like this and looked like the T cell people were more advanced.
Have a couple of questions:
-
I already have an SCE object with integrated vdj data, I have the aa sequences, V gene identity and of course barcode, and I have subsetted so they are all heavy chain only. Is there a way to use this without starting again with scRepertoire?. Looks from getBCR that I could change the name of my colnames in colData for aa, v genes. to match the ones used by scRepertoire.... Would this work? what else would I need to change or make sure to go directly to runIbex with my SCE.
-
which dataset was used for the training? Sorry I might have missed it and not very familiar yet with machine learning.
Finally, a suggestion:
Pulling V genes just with grep from GEX data is not ideal as there are a lot of pseudogenes, I would suggest using chromosomal location or the biotype assigned by cellranger, soomething like:
ig_list <- c("IG_C_gene", "IG_C_pseudogene", "IG_D_gene", "IG_D_pseudogene", "IG_J_gene", "IG_LV_gene", "IG_pseudogene", "IG_V_gene", "IG_V_pseudogene")
And then can query into biomart for genes that have that biotype that are also in your dataset.
Carlos