While my data falls into the bio realm, I don't think the actual operation uses anything that BioPerl provides. (We do have BioPerl on the server as it is used in other scripts.) I have sent a registration request to the BioPerl mailing list though.
If it helps, think of my data as some sort of date string where I need to search for any other occurrences of the central portion. 072017, where I need to find any other "transaction" that happened with the matching month of 07 and year of 17. Basically I need to split each string into three parts, and find matches in the database that are identical only in the first and last part. (But I have 250k+ iterations to search from, through a list of over 1 million strings each time.)
|