Skip to main content

Generalized Statistical Label Fusion using Multiple Consensus Levels

Posted by on Wednesday, February 1, 2012 in Image Segmentation, Magnetic resonance imaging, Neuroimaging, Registration, Reproducibility.

Z. Xu, A. Asman and B. Landman. “Generalized Statistical Label Fusion using Multiple Consensus Levels.” In Proceedings of the SPIE Medical Imaging Conference. San Diego, California, February 2012 (Oral Presentation) PMC3438516†

Full Text: https://www.ncbi.nlm.nih.gov/pubmed/?term=%E2%80%9CGeneralized+Statistical+Label+Fusion+using+Multiple+Consensus+Levels.

Abstract

Segmentation plays a critical role in exposing connections between biological structure and function. The process of label fusion collects and combines multiple observations into a single estimate. Statistically driven techniques provide mechanisms to optimally combine segmentations; yet, optimality hinges upon accurate modeling of rater behavior. Traditional approaches, e.g., Majority Vote and Simultaneous Truth and Performance Level Estimation (STAPLE), have been shown to yield excellent performance in some cases, but do not account for spatial dependences of rater performance (i.e., regional task difficulty). Recently, the COnsensus Level, Labeler Accuracy and Truth Estimation (COLLATE) label fusion technique augmented the seminal STAPLE approach to simultaneously estimate regions of relative consensus versus confusion along with rater performance. Herein, we extend the COLLATE framework to account for multiple consensus levels. Toward this end, we posit a generalized model of rater behavior of which Majority Vote, STAPLE, STAPLE Ignoring Consensus Voxels, and COLLATE are special cases. The new algorithm is evaluated with simulations and shown to yield improved performance in cases with complex region difficulties. Multi-COLLATE achieve these results by capturing different consensus levels. The potential impacts and applications of generative model to label fusion problems are discussed.

Figure 2 Results from the controlled rater observation simulation. The target intensity image and the corresponding true labels can be seen (A) and (B), respectively. The set of input observations used in this experiment can be seen in (C). The results of the four algorithms, Majority Vote, STAPLE, Binary-COLLATE and Multi-COLLATE can be seen in (D)–(G). Note that the 3 consensus-level implementation of Multi-COLLATE is the only algorithm that gets the correct answer. The 3 energy in each of the consensus levels can be seen in (H)–(J). Note that by distributing the non-consensus voxels into two separate levels, (H) and (I), the estimation process converges to the correct answer.
Figure 2
Results from the controlled rater observation simulation. The target intensity image and the corresponding true labels can be seen (A) and (B), respectively. The set of input observations used in this experiment can be seen in (C). The results of the four algorithms, Majority Vote, STAPLE, Binary-COLLATE and Multi-COLLATE can be seen in (D)–(G). Note that the 3 consensus-level implementation of Multi-COLLATE is the only algorithm that gets the correct answer. The 3 energy in each of the consensus levels can be seen in (H)–(J). Note that by distributing the non-consensus voxels into two separate levels, (H) and (I), the estimation process converges to the correct answer.

Tags: ,