|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object compbio.data.sequence.SequenceUtil
public final class SequenceUtil
Utility class for operations on sequences
Field Summary | |
---|---|
static Pattern |
AA
Valid Amino acids |
static Pattern |
AMBIGUOUS_AA
Same as AA pattern but with two additional letters - XU |
static Pattern |
AMBIGUOUS_NUCLEOTIDE
Ambiguous nucleotide |
static Pattern |
DIGIT
A digit |
static Pattern |
NON_AA
inversion of AA pattern |
static Pattern |
NON_NUCLEOTIDE
Non nucleotide |
static Pattern |
NONWORD
Non word |
static Pattern |
NUCLEOTIDE
Nucleotides a, t, g, c, u |
static Pattern |
WHITE_SPACE
A whitespace character: [\t\n\x0B\f\r] |
Method Summary | |
---|---|
static String |
cleanSequence(String sequence)
Removes all whitespace chars in the sequence string |
static void |
closeSilently(Logger log,
Closeable stream)
Closes the Closable and logs the exception if any |
static String |
deepCleanSequence(String sequence)
Removes all special characters and digits as well as whitespace chars from the sequence |
static boolean |
isAmbiguosProtein(String sequence)
Check whether the sequence confirms to amboguous protein sequence |
static boolean |
isNonAmbNucleotideSequence(String sequence)
Ambiguous DNA chars : AGTCRYMKSWHBVDN // differs from protein in only one (!) - B char |
static boolean |
isNucleotideSequence(FastaSequence s)
|
static boolean |
isProteinSequence(String sequence)
|
static HashSet<Score> |
readAAConResults(InputStream results)
Read AACon result with no alignment files. |
static List<FastaSequence> |
readFasta(InputStream inStream)
Reads fasta sequences from inStream into the list of FastaSequence objects |
static List<AnnotatedSequence> |
readJRonn(File result)
|
static List<AnnotatedSequence> |
readJRonn(InputStream inStream)
Reader for JRonn horizontal file format |
static void |
writeFasta(OutputStream os,
List<FastaSequence> sequences)
Writes FastaSequence in the file, each sequence will take one line only |
static void |
writeFasta(OutputStream outstream,
List<FastaSequence> sequences,
int width)
Writes list of FastaSequeces into the outstream formatting the sequence so that it contains width chars on each line |
static void |
writeFastaKeepTheStream(OutputStream outstream,
List<FastaSequence> sequences,
int width)
|
Methods inherited from class java.lang.Object |
---|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final Pattern WHITE_SPACE
public static final Pattern DIGIT
public static final Pattern NONWORD
public static final Pattern AA
public static final Pattern NON_AA
public static final Pattern AMBIGUOUS_AA
public static final Pattern NUCLEOTIDE
public static final Pattern AMBIGUOUS_NUCLEOTIDE
public static final Pattern NON_NUCLEOTIDE
Method Detail |
---|
public static boolean isNucleotideSequence(FastaSequence s)
public static boolean isNonAmbNucleotideSequence(String sequence)
public static String cleanSequence(String sequence)
sequence
-
public static String deepCleanSequence(String sequence)
sequence
-
public static boolean isProteinSequence(String sequence)
sequence
-
public static boolean isAmbiguosProtein(String sequence)
sequence
-
public static void writeFasta(OutputStream outstream, List<FastaSequence> sequences, int width) throws IOException
outstream
- sequences
- width
- - the maximum number of characters to write in one line
IOException
public static void writeFastaKeepTheStream(OutputStream outstream, List<FastaSequence> sequences, int width) throws IOException
IOException
public static List<FastaSequence> readFasta(InputStream inStream) throws IOException
inStream
- from
IOException
public static void writeFasta(OutputStream os, List<FastaSequence> sequences) throws IOException
os
- sequences
-
IOException
public static List<AnnotatedSequence> readJRonn(File result) throws IOException, UnknownFileFormatException
IOException
UnknownFileFormatException
public static List<AnnotatedSequence> readJRonn(InputStream inStream) throws IOException, UnknownFileFormatException
>Foobar M G D T T A G 0.48 0.42 0.42 0.48 0.52 0.53 0.54Where all values are tab delimited
inStream
- the InputStream connected to the JRonn output file
AnnotatedSequence
objects
IOException
- is thrown if the inStream has problems accessing the data
UnknownFileFormatException
- is thrown if the inStream represents an unknown source of
data, i.e. not a JRonn outputpublic static final void closeSilently(Logger log, Closeable stream)
log
- stream
- public static HashSet<Score> readAAConResults(InputStream results)
results
- output file of AAConservation
Method
-> float[]
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |