Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: allele rle normalization + pin pydantic version #234
feat: allele rle normalization + pin pydantic version #234
Changes from 1 commit
c8f5419
1ed754c
8ac1649
2113235
b579734
59424bc
14e5bc9
6736891
9938462
1b7c2ff
daa2195
812f480
af267f0
05deaca
b64aa8b
2e1923a
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
An IRI is a reference to another object. It can be of any form under the IETF specification. When we say the
sequence
slot is dereferenced, it means that instead of an IRI, we have aSequenceReference
object. This is true for every property in VRS where we allow for an IRI or object.I think it is fair for us to assume this property (and every property) is dereferenced / has full object representation for normalization. We SHOULD NOT assume that an IRI takes a specific form (e.g. a refseq or ga4gh identifier) as we do here. I also believe that IRIs that contain a colon before an IRI fragment identifier (
#
; again, as seen here) are not valid IRIs.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should add a regex pattern on the IRI class. I can make a new issue for this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Okay, I will update the code + tests to always assume a
SequenceReference
This was just examples for tests. The
SequenceProxy
class will take the input (regardless of refseq/ga4gh/ensembl etc) to get the corresponding sequence.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ahwagner thanks for this clarification. The Translator class will need to be updated to work like this (doesn't need to be in this PR). Currently it sets the sequence id (
ga4gh:SQ
, notga4gh:SQR
) as thelocation.sequence
value