|
|
mwfilter |
mwfilter removes unwanted (noisy) data from mass spectrometry output in proteomics. It reads an input file with a list of experimental molecular weights and writes the same data to an output file but with unwanted (noisy) data removed, at a specified ppm tolerance. Optionally the deleted molecular weights are shown. It removes those molecular weights which are:
The last two operations can be done as most peaks are reported in both modified and unmodified forms. Removal of modified peaks aids in database searching for protein identification.
% mwfilter Filter noisy data from molecular weights file Molecular weights file: molwts.dat Ppm tolerance [50.0]: Molecular weights output file [molwts.mwfilter]: |
Go to the input files for this example
Go to the output files for this example
Standard (Mandatory) qualifiers:
[-infile] infile Molecular weights file
-tolerance float [50.0] Ppm tolerance (Any numeric value)
[-outfile] outfile [*.mwfilter] Molecular weights output file
Additional (Optional) qualifiers:
-showdel boolean [N] Output deleted mwts
Advanced (Unprompted) qualifiers:
-datafile datafile [Emwfilter.dat] Molecular weight standards
data file
Associated qualifiers:
"-outfile" associated qualifiers
-odirectory2 string Output directory
General qualifiers:
-auto boolean Turn off prompts
-stdout boolean Write first file to standard output
-filter boolean Read first file from standard input, write
first file to standard output
-options boolean Prompt for standard and additional values
-debug boolean Write debug output to program.dbg
-verbose boolean Report some/full command line options
-help boolean Report command line options. More
information on associated and general
qualifiers can be found with -help -verbose
-warning boolean Report warnings
-error boolean Report errors
-fatal boolean Report fatal errors
-die boolean Report dying program messages
|
| Standard (Mandatory) qualifiers | Allowed values | Default | |
|---|---|---|---|
| [-infile] (Parameter 1) |
Molecular weights file | Input file | Required |
| -tolerance | Ppm tolerance | Any numeric value | 50.0 |
| [-outfile] (Parameter 2) |
Molecular weights output file | Output file | <*>.mwfilter |
| Additional (Optional) qualifiers | Allowed values | Default | |
| -showdel | Output deleted mwts | Boolean value Yes/No | No |
| Advanced (Unprompted) qualifiers | Allowed values | Default | |
| -datafile | Molecular weight standards data file | Data file | Emwfilter.dat |
874.364756 927.450380 1045.572 1068.397129 1121.431124 1163.584593 1305.660840 1428.596448 1479.748341 1502.549157 1554.591658 1567.686209 1576.708354 1639.868056 1748.611920 1753.745298 1880.841178 2383.99 |
The input file is a simple list of the experimental molecular weights.
There should be one weight per line.
Comments in the data file start with a '#' character in the first column.
Blank lines are ignored.
874.364756 927.450380 1068.397129 1121.431124 1163.584593 1305.660840 1428.596448 1479.748341 1502.549157 1554.591658 1567.686209 1576.708354 1639.868056 1748.611920 1753.745298 1880.841178 |
EMBOSS data files are distributed with the application and stored in the standard EMBOSS data directory, which is defined by the EMBOSS environment variable EMBOSS_DATA.
To see the available EMBOSS data files, run:
% embossdata -showall
To fetch one of the data files (for example 'Exxx.dat') into your current directory for you to inspect or modify, run:
% embossdata -fetch -file Exxx.dat
Users can provide their own data files in their own directories. Project specific files can be put in the current directory, or for tidier directory listings in a subdirectory called ".embossdata". Files for all EMBOSS runs can be put in the user's home directory, or again in a subdirectory called ".embossdata".
The directories are searched in the following order:
| Program name | Description |
|---|---|
| backtranambig | Back-translate a protein sequence to ambiguous nucleotide sequence |
| backtranseq | Back-translate a protein sequence to a nucleotide sequence |
| charge | Draw a protein charge plot |
| checktrans | Reports STOP codons and ORF statistics of a protein |
| compseq | Calculate the composition of unique words in sequences |
| emowse | Search protein sequences by digest fragment molecular weight |
| freak | Generate residue/base frequency table or plot |
| iep | Calculate the isoelectric point of proteins |
| mwcontam | Find weights common to multiple molecular weights files |
| octanol | Draw a White-Wimley protein hydropathy plot |
| pepinfo | Plot amino acid properties of a protein sequence in parallel |
| pepstats | Calculates statistics of protein properties |
| pepwindow | Draw a Kyte-Doolittle hydropathy plot for a protein sequence |
| pepwindowall | Draw Kyte-Doolittle hydropathy plot for a protein alignment |
| wordcount | Count and extract unique words in DNA sequence(s) |