Are there any efforts to train models on short DNA sequences? For example, sequences no longer than 50 DNA nucleotides?