You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In order to compare the difference between diamond1 and 2, I built DIAMOND database from the nr database separately. And use diamond1 and 2 for sequence alignment with the same set of nucleic acid sequences, the parameters are the same.
From the results, I found two main differences:
The subject sequence obtained from diamond1 has a version number, such as: XP_028630034.1; while the subject sequence obtained from diamond2 does not have a version number, such as: XP_028630034;
The result obtained by diamond2, the top one subject sequence is always from the UniProtKB/Swiss-Prot, while the result obtained by diamond1 is always the result of the NCBI Ref database.
I read the paper "Sensitive protein alignments at tree-of-life scale using DIAMOND", it is mentioned that the benchmark database uses UniRef50 database information. I'm not sure if the second question I just mentioned is related to this?
Regarding these two questions, I hope to get your answers. thank you very much
wanjin hu
The text was updated successfully, but these errors were encountered:
Did you use a BLAST database when running diamond v2? That would explain the different accessions. Note that for example NP_080749.2 and Q922F4 are the same proteins.
Hi there:
In order to compare the difference between diamond1 and 2, I built DIAMOND database from the nr database separately. And use diamond1 and 2 for sequence alignment with the same set of nucleic acid sequences, the parameters are the same.
From the results, I found two main differences:
The subject sequence obtained from diamond1 has a version number, such as: XP_028630034.1; while the subject sequence obtained from diamond2 does not have a version number, such as: XP_028630034;
The result obtained by diamond2, the top one subject sequence is always from the UniProtKB/Swiss-Prot, while the result obtained by diamond1 is always the result of the NCBI Ref database.
diamond1 result:
diamond2 result:
I read the paper "Sensitive protein alignments at tree-of-life scale using DIAMOND", it is mentioned that the benchmark database uses UniRef50 database information. I'm not sure if the second question I just mentioned is related to this?
Regarding these two questions, I hope to get your answers. thank you very much
wanjin hu
The text was updated successfully, but these errors were encountered: