Specifically, we're looking at words of six letters, each being A, C, G or T. A and C are m while G and T are k. We're looking for groups of two, three and four where any of the six spaces are not exclusively m or k. From a corpus of 19 barcodes of a possible 48, using the sequencers available, we have developed a set of 247,528 triplets of triplets where:
- the barcodes within each triplet do not conflict on the m and k thing
- a barcode used within triplet 1 is not used in triplet 2 or triplet 3, and so on
Now, we want to cut that list down to where, if, for example, samples A and B from triplet 1 and sample C from triplet 3 fail, can those samples with those barcodes be used together? My coworker believes that it should be possible to find groups where all 9 can be used interchangably, but his code so far fails to select them.
I look at that and think that subdividing them into groups that work together and giving up the ones with the longest list is probably the way to go.
I look at that and think that subdividing them into groups that work together and giving up the ones with the longest list is probably the way to go.
Which is what I'm now about to implement.
No comments:
Post a Comment