Technical information



Dependencies: Java SE Development Kit, Apache Ant

Links to articles


Web-directory structure

MS-Align+ assumes that all files are available via web and requires special directory structure. All static files that exposed to the web should be copied from

Each user dataset with spectrums data, protein database, parameters and resulting files is placed in separate directory.

form.html                     - web-pages that creates datasets directories with the required data and parameters via JettyServer at /action/upload
<dataset directory #1>        - directories with datasets
...<dataset directory #N>
Directory structure for the dataset
    input_data                - spectrum data          - configuration file
    <ProteinDatabase>.fasta   - protein database
    result.txt                - MS-Align+ results for further processing
    result_table.txt          - tab-separated results file
    A lot of *.xml generated from MS-Align+ results.
    proteins.html             - index of all discovered proteins, the entry point for viewing MS-Align+ results
    sorttable.js              - JavaScript code that supports sorting on PrSMs pages (    
    A lot of htmls generated from /xml folder using XSLT
Sample file
# Error tolerance in PPM
# E-value threshold
# Maximum number of modifications
#Cysteine protection group can be Carbamidoemetylation and Carboxymethylation
#instrument can be ETD
Full process of the input data

This command runs MS-ALign+, stores results in msoutput directory, generates xml files in xml directory  and generates html files in html directory.

java -classpath jar/*:lib/jar/* edu.ucsd.msalign.spec.web.Pipeline <PathToTheDatasetDir>

We recommend to run it on the computer with 12GB of free RAM using command line option like in example below:

java -server -d64 -Xmx12048M -classpath jar/*:lib/jar/* edu.ucsd.msalign.spec.web.Pipeline <PathToTheDatasetDir>

Generating XML files from MS-ALign+ results

This command assumes that MS-ALign+ results are ready in msouput directory and generates xml files in xml directory.

java -classpath jar/*:lib/jar/* edu.ucsd.msalign.spec.view.ResultProcessor <PathToTheDatasetDir> <ProteinDatabase>.fasta <Maximum E-value>

Maximum E-value is an optional parameter, it removes all PrSMs with greater E-value. If it is not specified all PrSM generated by MS-Align are used.

MS-Align uses threshold value in as maximum E-value.

Generating HTML files form XML

This command assumes that xml files are ready in xml directory and generates html files in html directory using xslt.

ant -lib lib/jar/saxon9he.jar -Ddataset=<PathToTheDatasetDir> xslt

Build release

Create empty directory and go to it. After that execute

svn checkout .

ant release

After that you will get file msalign-build<Revision-number>.zip.

How to update project from svn

svn up

ant jarmsalign

Jetty server

For the automatic creation of the dataset directories with correct parameters we have special web server based on Jetty, that can be started using following command:

java -classpath jar/*:lib/jar/* edu.ucsd.msalign.spec.web.JettyServer <PathToTheRootDirectoryOfAllDatasets> <Port>

Port parameter is optional and default value is 8080.

forms.html assumes that Jetty listens at /action/upload and the front-end web-server should be configured to pass all /action/* requests to Jetty.

Sample configuration code from nginx.conf looks like

 location /action {


Xiaowen Liu, Yakov Sirotkin, Pavel Pevzner

Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.