Skip to content

Nonstandard values in DMS csv files #8

@alex-hh

Description

@alex-hh

Hi, had a go at running the fitness benchmark and just wanted to flag that there are a few non-standard values in some of the datasets:

  • some of the DMS files provide DNA sequences, some RNA sequences - is there a reason for this?
  • all of the sequences in A0A2Z5U3Z0_9INFA_Doud_2016 end in 'EXI' - does this have a meaning?
  • Soo_2021_ribozyme contains rows with NaN DMS_score values

Thanks again for the benchmarks!

Metadata

Metadata

Labels

questionFurther information is requested

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions