AACon Web Service
This page provides some tips on using AACon web service. Please note that for now all the examples are in Java. Other languages will follow given a sufficient demand. Please also note that AACon web service client requires Java 6.
Outline- AACon web service Details
- Using web service command line client
- Structure of the template command line client
- AACon web service functions overview
- Building web services artifacts
- Connecting to AAConWS
- Calculating conservation
- Checking the status of the calculation
- Calculating conservation with presets
- Calculating conservation with custom parameters
- A complete AAConWS web service client example
AACon Web Service Details
Feature | Description |
---|---|
Operated by | The University of Dundee, Computational Biology Group headed by Prof Geoff Barton. It is backed up by the College of Life Sciences HPC cluster. |
Service web site | http://www.compbio.dundee.ac.uk/aacon |
Execution limits | Tasks exceeding 5000×1000 (sequences per letters) will not be accepted for alignment. If you would like to work with bigger alignments consider using a command line version of AACon. |
AACon web service WSDL | http://www.compbio.dundee.ac.uk/aacon/AAConWS?wsdl |
Using Web Service Command Line Client
The command client can be used to calculate conservation using public AACon web service. The client is operating system independent and supports most of the AACon web service functions. Using this client you could calculate conservation using web service features such as presets and parameters. Below is the list of options supported by the command line client.
Usage: java -jar <path_to_jar_file> ACTION [OPTIONS]ACTIONS:
-i=<inputFile> - full path to Fasta or Clustal formatted alignment file
-parameters - lists parameters supported by web service
-presets - lists presets supported by web service
-limits - lists web services limits
Please note that if input file is specified other actions are ignored
OPTIONS: (only for use with -i action):
-r=<presetName> - name of the preset to use
-o=<outputFile> - full path to the file where to write the results
-f=<parameterInputFile> - the name of the file with the list of parameters to use.
Please note that -r and -f options cannot be used together. Conservation can be calculated with either a preset or the parameters from the file, but not both!
Using the command line web service client is easy. For example to calculate conservation for the alignment from input.fasta file using AACon web service with default settings and print the results to the console. By default, the conservation is calculated by SHENKIN method.
java -jar aacon-client.jar -i=d:\input.fasta
Content of input.fasta
file is show below (please note sequences has been trimmed for clarity)
>QUERY
MEWLKIALVAFSGRGNDVDYPCSILGRVANNDKELVNIL
>Q4VBK1
---------------PDTDEPVWILGACYNVKTKKSELL
>A7RTH8
---MDAACVTYEGVSHETEEDVWILGKRYNIDMGYLNTD
java -jar aacon-client.jar -i=d:\input.fasta -o=d:\result.txt -r="Quick conservation"
Get the list of presets supported by AACon web service
java -jar aacon-client.jar -presets
Structure of the template command line client
Packages | Classes and Interfaces |
compbio.data.msa | Annotation interface for all scoring methods |
compbio.data.sequence | AAConWS data types |
compbio.metadata | AAConWS meta data types |
compbio.ws.client | AAConWS command line client package with AAConClient.java class containing the main method |
Please refer to a data model javadoc for a detailed description of each class and its methods.
AACon web service functions overview
Functions for conservation calculation String id = analize(List<FastaSequence> list)
String id = customAnalize(List<FastaSequence> sequenceList, List<Option> optionList)
String id = presetAnalize(List<FastaSequence> sequenceList, Preset preset)
Functions pertaining to job monitoring and control
JobStatus status = getJobStatus(String id)
HashSet<Score> conservation = getConservation(String id)
boolean cancelled = cancelJob(String id)
ChunkHolder chunk = pullExecStatistics(String id, long marker)
Functions relating to service features discovery
RunnerConfig rc = getRunnerOptions()
Limit limit = getLimit(String name)
LimitsManager lm = getLimits()
PresetManager pm = getPresets()
Please refer to a data model javadoc for a detailed description of each methods.
Building web services artifacts
AAConWS are the standard JAX-WS SOAP web services, which are WS-I basic profile compatible. This means that you could use your favorite programming language to work with AAConWS. Below is how you can generate portable artifacts to work with AAConWS from Java. However, if programming in Java we recommend using our client library as it provides a handful of useful methods in addition to plain data types.
wsimport -keep http://www.compbio.dundee.ac.uk/aacon/AAConWS?wsdl
Connecting to AAConWS
All the examples below assume that AACon web service command line client is in the classpath. You can download it from the download page. The code excerpt below will connect your program to AAConWS web service deployed in the University of Dundee.
Annotation<AAConWS> client = AAConWSClient.connect();
If you want to work with the generated artifacts directly you can inspect AAConWS command line client source code and use it as a template for building your own custom AAConWS clients.
Calculating conservation
Given that client is web service proxy, created as described in "Connecting to AAConWS" section, the conservation scores can be obtained as follows:
1) List<FastaSequence> fsl = SequenceUtil.readFasta(new FileInputStream("alignment.fasta"));
2) String jobId = client.analize(fsl);
3) HashSet<Score> result = client.getAnnotation(jobId);
Line one loads sequence alignment from the file
Line two submits them to web service represented by AAConWS proxy
Line three retrieves the conservation scores calculated according to Shenkin algorithm. This line blocks the execution until the result is available. Use this with caution. In general, you should make sure that the calculation has been completed before attempting retrieving results. This is to avoid keeping the connection to the server on hold for a prolonged periods of time. While this may be ok with your local server, our public server (www.compbio.dundee.ac.uk/aacon) will not let you hold the connection for longer than 10 minutes. This is done to prevent excessive load on the server. The next section describes how to check the status of the calculation.
Methods and classes mentioned in the excerpt are available from the AAConWS client library.
Checking the status of the calculation
You may have noticed that there was no pause between submitting the job and retrieving the results. This is because getAnnotation(jobId) method block the processing until the calculation is completed. However, taking into account that the connection holds server resources, our public server (www.compbio.dundee.ac.uk/aacon) is configured to reset the connection after 10 minutes of waiting. To work around the connection reset you are encouraged to check whether the calculation has been completed before accessing the results. You can do it like this:
while (client.getJobStatus(jobId) != JobStatus.FINISHED) {
Thread.sleep(2000); // wait two seconds, then recheck the status
}
Calculating conservation with presets
1) PresetManager<AACon> presets = client.getPresets();
2) String jobId = client.presetAnalize(fsl, presets.getPresetByName("Quick conservation"));
3) HashSet<Score> result = client.getAnnotation(jobId);
Line one obtains the lists of presets supported by a web service.
Line two calls web service presetAnalise method with a chosen preset. This call returns a job identifier.
Lines three retrieves the results using job identifier.
Available presets are
- "Quick conservation" (a collection of 12 fast conservation algorithms includes KABAT, JORES, SCHNEIDER, SHENKIN, GERSTEIN, TAYLOR_GAPS, TAYLOR_NO_GAPS, ZVELIBIL,ARMON, THOMPSON, NOT_LANCET, MIRNY, WILLIAMSON)
- "Slow conservation" (a collection of time consuming conservation algorithms includes LANDGRAF, KARLIN, SANDER, VALDAR and SMERFS)
- "Complete conservation" (all available algorithms)
Calculating conservation with custom parameters
Below is the example of using custom parameters for SMERFS method.
// Using options
1) RunnerConfig<AACon> options = client.getRunnerOptions();
2) options.getArgument("Calculation method").setDefaultValue("SMERFS");
3) options.getArgument("SMERFS Column Scoring Method").setDefaultValue("MAX_SCORE");
4) options.getArgument("SMERFS Gap Threshhold").setDefaultValue("1");
5) String jobId = client.customAnalize(fsl, options.getArguments());
6) HashSet<Score> result = client.getAnnotation(jobId);
Line one obtains the RunnerConfig object that holds information on supported parameters and their values
Line two retrieve a particular parameter from the holder by its name and sets the new value for this parameter.
Lines three and four do the same but for two more parameters
Line five submit a job to a web service
Line six retrieves the results of the analysis. The names of all the parameters supported by a web service can be obtained using options.getArguments() method. Further details on the methods available from RunnerConfig object are available from the javadoc.
A complete AAConWS web service client example
Finally, a complete example of the program that connects to AAConWS service is below. Three is also a PDF version of this example with syntax highlighted. The text notes are commented by block style comments e.g. /* comment */, the code alternatives are commented out with the line comments - "//". You may want to remove line style comments to test alternatives of the functions. All you need for this to work is a AAConWS binary client. Please make sure that the client is in the Java class path before running this example.
import java.io.ByteArrayInputStream; import java.io.FileNotFoundException; import java.io.IOException; import java.util.List; import java.util.Set; import compbio.data.msa.Annotation; import compbio.data.sequence.FastaSequence; import compbio.data.sequence.Score; import compbio.data.sequence.SequenceUtil; import compbio.metadata.JobSubmissionException; import compbio.metadata.Preset; import compbio.metadata.PresetManager; import compbio.metadata.ResultNotAvailableException; import compbio.metadata.UnsupportedRuntimeException; import compbio.metadata.WrongParameterException; import compbio.runner.conservation.AACon; /** * AAConWS client example */ public class AAConWSClientExample { /* * Input sequences for alignment. For the simplicity keep them in the class */ static final String input = ">Foo \r\n" + "MTADGPRELLQLRAAVRHRPQDFVAWLMLADAELGMGDTTAGEMAVQRGLALHPGHPEAV\r\n" + "ARLGRVRWTQQRHAEAAVLLQQASDAAPEHPGIALWLGHALEDAGQAEAAAAAYTRAHQL\r\n" + "LPEEPYITAQLLNWRRRLCDWRALDVLSAQVRAAVAQGVGAVEPFAFLSEDASAAEQLAC\r\n" + "ARTRAQAIAASVRPLAPTRVRSKGPLRVGFVSNGFGAHPTGLLTVALFEALQRRQPDLQM\r\n" + "HLFATSGDDGSTLRTRLAQASTLHDVTALGHLATAKHIRHHGIDLLFDLRGWGGGGRPEV\r\n" + "FALRPAPVQVNWLAYPGTSGAPWMDYVLGDAFALPPALEPFYSEHVLRLQGAFQPSDTSR\r\n" + "VVAEPPSRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADA\r\n" + "RLRAFAHAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVL\r\n" + "TTPGETFAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASDPAALTALHARVDVLRRES\r\n" + "GVFEMDGFADDFGALLQALARRHGWLGI\r\n" + "\r\n" + ">Bar \r\n" + "-----------------------------------MGDTTAGEMAVQRGLALH-------\r\n" + "---------QQRHAEAAVLLQQASDAAPEHPGIALWL-HALEDAGQAEAAAA-YTRAHQL\r\n" + "LPEEPYITAQLLN--------------------AVAQGVGAVEPFAFLSEDASAAE----\r\n" + "----------SVRPLAPTRVRSKGPLRVGFVSNGFGAHPTGLLTVALFEALQRRQPDLQM\r\n" + "HLFATSGDDGSTLRTRLAQASTLHDVTALGHLATAKHIRHHGIDLLFDLRGWGGGGRPEV\r\n" + "FALRPAPVQVNWLAYPGTSGAPWMDYVLGDAFALPPALEPFYSEHVLRLQGAFQPSDTSR\r\n" + "VVAEPPSRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADA\r\n" + "RLRAFAHAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVL\r\n" + "TTPGETFAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASDPAALTALHARVDVLRRES\r\n" + "GVFEMDGFADDFGALLQALARRHGWLGI\r\n" + "\r\n" + ">Noname \r\n" + "-MTADGPRELLQLRAAVRHRPQDVAWLMLADAELGMGDTTAGEMAVQRGLALHPGHPEAV\r\n" + "ARLGRVRWTQQRHAEAAVLLQQASDAAPEHPGIALWLGHALED--------------HQL\r\n" + "LPEEPYITAQLDVLSAQVR-------------AAVAQGVGAVEPFAFLSEDASAAEQLAC\r\n" + "ARTRAQAIAASVRPLAPTRVRSKGPLRVGFVSNGFGAHPTGLLTVALFEALQRRQPDLQM\r\n" + "HLFATSGDDGSTLRTRLAQASTLHDVTALGHLATAKHIRHHGIDLLFDLRGWGGGGRPEV\r\n" + "FALRPAPVQVNWLAYPGTSGAPWMDYVLGDAFALPPALEPFYSEHVLRLQGAFQPSDTSR\r\n" + "VVAEPPSRTQCGLPEQGVVLCCFNNSYKLNPQSMARMLAVLREVPDSVLWLLSGPGEADA\r\n" + "RLRAFAHAQGVDAQRLVFMPKLPHPQYLARYRHADLFLDTHPYNAHTTASDALWTGCPVL\r\n" + "TTPGETFAARVAGSLNHHLGLDEMNVADDAAFVAKAVALASDPAALTALHARVDVLRRES\r\n" + "I---------------------------"; public static void main(String[] args) throws UnsupportedRuntimeException, JobSubmissionException, WrongParameterException, FileNotFoundException, IOException, ResultNotAvailableException, InterruptedException { /* * Annotation interface for AAConWS web service instance */ Annotation<AACon> client = AAConClient.connect(); /* Get the list of available presets */ PresetManager presetman = client.getPresets(); /* Get the Preset object by preset name */ Preset preset = presetman.getPresetByName("Complete conservation"); /* * Load sequences in FASTA format from the file You can use something * like new FileInputStream() to load sequence from the file */ List<FastaSequence> fastalist = SequenceUtil .readFasta(new ByteArrayInputStream(input.getBytes())); /* * Submit loaded sequences for an alignment using preset. The job * identifier is returned by this method, you can retrieve the results * with it sometime later. */ String jobId = client.presetAnalize(fastalist, preset); /* This method will block for the duration of the calculation */ Set<Score> result = client.getAnnotation(jobId); /* * This is a better way of obtaining results, it does not involve * holding the connection open for the duration of the calculation, * Besides, as the University of Dundee public server will reset the * connection after 10 minutes of idling, this is the only way to obtain * the results of long running task from our public server. */ // while (client.getJobStatus(jobId) != JobStatus.FINISHED) { // Thread.sleep(1000); // wait a second, then recheck the status // } /* Output the alignment to standard out */ Score.write(result, System.out); /* Alternatively, you can record retrieved alignment into the file */ // FileOutputStream out = new FileOutputStream("result.txt"); // Score.write(result, out); // out.close(); } }For a more detailed description of all available types and their functions please refer to the data model javadoc.