http://qs321.pair.com?node_id=1198138


in reply to Re: Find duplicate based on specific fields while allowing 2 mismatch
in thread Find duplicate based on specific fields while allowing 2 mismatch

I think, I need to take the first entry as the reference and allow the two possible mismatch using first line's UMI tag to make one cluster. Remaning lines at the same start positions, can be looped again similarly. So, if the first line have AAAA then AAGG or TTAA, etc can be merged into single cluster, But, TTGG will make separate cluster. I have edited the question for the same! Amit
  • Comment on Re^2: Find duplicate based on specific fields while allowing 2 mismatch

Replies are listed 'Best First'.
Re^3: Find duplicate based on specific fields while allowing 2 mismatch
by kcott (Archbishop) on Aug 28, 2017 at 10:21 UTC
    "I have edited the question for the same!"

    Do not just edit your question without showing very clearly what you've changed: it often means that answers to your original post no longer make sense.

    You can do this in a number of ways:

    • For a small change within a sentence: <del>old text</del><ins>new text</ins>.
    • To remove a large amount of text, code, or data: add an Update comment explaining what you're doing, then <strike>... part to remove ...</strike>.
    • To add new content: use an Update comment explaining what you're doing, then add the new content.
    • To change a large amount of text, code, or data: add an Update comment explaining what you're doing, then <strike>...</strike> (as per the lastearlier point); then add replacement text, code, or data.
    • If you're striking out significant sections of your post, and the result ends up difficult to read, consider putting those parts inside <spoiler>...</spoiler> or <readmore>...</readmore> tags; however, ensure the Update comment is visible.

    Never just delete some, or all, of your post!

    See "How do I change/delete my post?" for details and further discussion.

    — Ken