Speech recognition. Building word-level HMM from phone-level HMMs. Transtion matrix

Asked Feb 27 '25 at 21:04

Active Feb 27 '25 at 21:05

Viewed 43 times

I am implementing my HMM-GMM speech recognition model.

Right now I am facing a problem described below.

Given phone-level HMMs A and B, build word-level HMM C. In this questions lets assume that according to lexicon file I need to make C from A and B where A is followed by B. Is it a common practice?

States of HMM A: a1, a2, a3

States of HMM B: b1, b2, b3

Let transition matrices for A and B be as follows:

As far as I understand C has states merged from A and B.

So states for HMM C: a1, a2, a3, b1, b2, b3.

But what about transition matrix?

But this doesnt seem like a legit solution.

What is the algorithm of concatination of such matrices? Or perhaps I am missing something. Link to a good article is highly appreciated.

edited Feb 27 '25 at 21:05

asked Feb 27 '25 at 21:04

ASR

Speech recognition. Building word-level HMM from phone-level HMMs. Transtion matrix

0 Answers0