mspalign
Align mass spectra from multiple peak lists from LC/MS or GC/MS data set
Syntax
[
CMZ
, AlignedPeaks
]
= mspalign(Peaklist
)
[CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'Quantile', QuantileValue
,
...)
[CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'EstimationMethod', EstimationMethodValue
,
...)
[CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'CorrectionMethod', CorrectionMethodValue
,
...)
[CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'ShowEstimation', ShowEstimationValue
,
...)
Input Arguments
Peaklist | Cell array of peak lists from a liquid chromatography/mass
spectrometry (LC/MS) or gas chromatography/mass spectrometry (GC/MS)
data set. Each element in the cell array is a two-column matrix with
m/z values in the first column and ion intensity values in the second
column. Each element corresponds to a spectrum or retention time. Note You can use the |
QuantileValue | Value that determines which peaks are selected by the estimation
method to create CMZ , the vector of common
m/z values. Choices are any value ≥ 0 and ≤
1 . Default is 0.95 . |
EstimationMethodValue | Character vector or string specifying the method to estimate CMZ ,
the vector of common mass/charge (m/z) values. Choices are:
|
CorrectionMethodValue | Character vector or string specifying the method to align each peak list to the
CMZ vector. Choices are:
|
ShowEstimationValue | Controls the display of an assessment plot relative to the
estimation method and the vector of common mass/charge (m/z) values.
Choices are true or false . Default
is either:
|
Output Arguments
CMZ | Vector of common mass/charge (m/z) values estimated by the mspalign function. |
AlignedPeaks | Cell array of peak lists, with the same form as Peaklist ,
but with corrected m/z values in the first column of each matrix. |
Description
[
aligns
mass spectra from multiple peak lists (centroided data), by first
estimating CMZ
, AlignedPeaks
]
= mspalign(Peaklist
)CMZ
, a vector of common mass/charge
(m/z) values estimated by considering the peaks in all spectra in Peaklist
,
a cell array of peak lists, where each element corresponds to a spectrum
or retention time. It then aligns the peaks in each spectrum to the
values in CMZ
, creating AlignedPeaks
,
a cell array of aligned peak lists.
[
calls CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'PropertyName
', PropertyValue
,
...)mspalign
with optional properties
that use property name/property value pairs. You can specify one or
more properties in any order. Each PropertyName
must
be enclosed in single quotation marks and is case insensitive. These
property name/property value pairs are as follows:
[
determines which peaks are selected by the estimation
method to create CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'Quantile', QuantileValue
,
...)CMZ
, the vector of common
m/z values. Choices are a scalar between 0
and 1
.
Default is 0.95
.
[
specifies the method used to estimate CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'EstimationMethod', EstimationMethodValue
,
...)CMZ
,
the vector of common mass/charge (m/z) values. Choices are:
histogram
— Default method. Peak locations are clustered using a kernel density estimation approach. The peak ion intensity is used as a weighting factor. The center of all the clusters conform to theCMZ
vector.regression
— Takes a sample of the distances between observed significant peaks and regresses the inter-peak distance to create theCMZ
vector with similar inter-element distances.
[
specifies the method used to align each peak list
to the CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'CorrectionMethod', CorrectionMethodValue
,
...)CMZ
vector. Choices are:
nearest-neighbor
— Default method. For each common peak in theCMZ
vector, its counterpart in each peak list is the peak that is closest to the common peak's m/z value.shortest-path
— For each common peak in theCMZ
vector, its counterpart in each peak list is selected using the shortest path algorithm.
[
controls the display of an assessment plot relative
to the estimation method and the estimated vector of common mass/charge
(m/z) values. Choices are CMZ
, AlignedPeaks
]
= mspalign(Peaklist
, ...'ShowEstimation', ShowEstimationValue
,
...)true
or false
.
Default is either:
false
— When return values are specified.true
— When return values are not specified.
Examples
Load a MAT-file, included with the Bioinformatics Toolbox™ software, which contains liquid chromatography/mass spectrometry (LC/MS) data variables, including
peaks
andret_time
.peaks
is a cell array of peak lists, where each element is a two-column matrix of m/z values and ion intensity values, and each element corresponds to a spectrum or retention time.ret_time
is a column vector of retention times associated with the LC/MS data set.load lcmsdata
Resample the unaligned data, display it in a heat map, and then overlay a dot plot.
[MZ,Y] = msppresample(ms_peaks,5000); msheatmap(MZ,ret_time,log(Y))
msdotplot(ms_peaks,ret_time)
Click the Zoom In button, and then click the dot plot two or three times to zoom in and see how the dots representing peaks overlay the heat map image.
Align the peak lists from the mass spectra using the default estimation and correction methods.
[CMZ, aligned_peaks] = mspalign(ms_peaks);
Resample the unaligned data, display it in a heat map, and then overlay a dot plot.
[MZ2,Y2] = msppresample(aligned_peaks,5000); msheatmap(MZ2,ret_time,log(Y2))
msdotplot(aligned_peaks,ret_time)
Link the axes of the two heat plots and zoom in to observe the detail to compare the unaligned and aligned LC/MS data sets.
linkaxes(findobj(0,'Tag','MSHeatMap')) axis([480 532 375 485])
References
[1] Jeffries, N. (2005) Algorithms for alignment of mass spectrometry proteomic data. Bioinfomatics 21:14, 3066–3073.
[2] Purvine, S., Kolker, N., and Kolker, E. (2004) Spectral Quality Assessment for High-Throughput Tandem Mass Spectrometry Proteomics. OMICS: A Journal of Integrative Biology 8:3, 255–265.
Version History
Introduced in R2007a
See Also
msbackadj
| msdotplot
| msalign
| msheatmap
| mslowess
| msnorm
| mspeaks
| msresample
| msppresample
| mssgolay
| msviewer
Topics
- Mass Spectrometry and Bioanalytics
- Preprocessing Raw Mass Spectrometry Data
- Visualizing and Preprocessing Hyphenated Mass Spectrometry Data Sets for Metabolite and Protein/Peptide Profiling
- Differential Analysis of Complex Protein and Metabolite Mixtures Using Liquid Chromatography/Mass Spectrometry (LC/MS)