I believe
moritz's script will do what you want, but your expected output is confusing:
MA01001A1A03.f1 760 5640111 ad1
MA01001A1A04.f1 300 15232924 ubq
MA01001A1B22.f1 580 77745475 ra
Should not the second line list the first occurrence of
ubq? And what happened to
DNA?
MA01001A1A03.f1 760 5640111 ad1
MA01001A1A03.f1 760 42572233 ubq
MA01001A1A04.f1 300 145334669 DNA
MA01001A1B22.f1 580 77745475 ra
Update: Ah, yes...
As moritz points out (in more polite words) below, I am an idiot. Or tired. I repent, expect swift and adequate punishment, and demand this node be voted down to about -5. (but not much more. please)