In this question, I follow the terminology and notation of the book of Watrous, most notably chapter two.
Extremal channels
An extremal channel $\Phi(X) \in C(\mathcal{X},\mathcal{Y})$ is a channel that can not be written as a convex combination of other channels. In other words, $\Phi$ is an extremal channel if and only if the equality \begin{equation} \Phi(X) = \lambda \Phi_{1}(X) + (1-\lambda)\Phi_{2}(X), \end{equation} for $\lambda \in (0,1)$ implies that $\Phi_{1} = \Phi_{2} = \Phi$.
Question
Given two extremal channels $\Lambda_{1} \in C(\mathcal{X},\mathcal{Y})$ and $\Lambda_{2} \in C(\mathcal{Y},\mathcal{Z})$, is the composition $\Lambda_{12} \in C(\mathcal{X},\mathcal{Z})$ defined by
$$ \Lambda_{12} = \Lambda_{2} \circ \Lambda_{1}, $$ also an extremal channel? Or are there extra criteria that the maps should adhere to for the composition to be an extremal channel, too?
In other words, suppose that $\Lambda_{12}(X) = \lambda\Lambda_{12}^{1}(X) + (1-\lambda)\Lambda_{12}^{2}(X)$, does that imply that $\Lambda_{12} = \Lambda_{12}^{1} = \Lambda_{12}^{2}$?.
Do $\Lambda_{12}^{1}$ and/or $\Lambda_{12}^{2}$ even necessarily need to have the structure of mapping $L(\mathcal{X})\rightarrow L(\mathcal{Y}) \rightarrow L(\mathcal{Z})$? (Insofar that this is not an ill-posed question).
Some thoughts
As is shown in the aforementiond book (Thm. $2.31$, page $96$), a channel represented by linearily independent Kraus operators $\{A_{k}\}_{k}$ is an extremal channel if and only if the collection $\{A_{k}^{\dagger}A_{k'}\}_{kk'}$ is also linearily independent.
Let $\{A_{k}: A_{k} \in L(\mathcal{X},\mathcal{Y})\}_{k}$ and $\{B_{l}\in L(\mathcal{Y},\mathcal{Z})\}_{l}$ be such Kraus operators for $\Phi_{1}$ and $\Phi_{2}$, respectively.
The collection $\{B_{l}A_{k}: B_{l}A_{k} \in L(\mathcal{X},\mathcal{Z})\}_{kl}$ forms a valid set of Kraus operators representing $\Phi$. However, these might not be linearily independent, which means that we cannot invoke the above criterium for extremal channels. One could take only a spanning but linearily independent subset and invoke the criterium, but I am not sure if this is a properly justified step. Even more so, such a set $\{A^{\dagger}_{k'}B^{\dagger}_{l'}B_{l}A_{k}\}_{(kl),(k'l')}$ is not necessarily linearily independent, I believe.
 
    