I have been digging into the past and found some material from the 1990’s when I was part of a European Union-funded international group that produced a corpus of spoken Bulgarian, Estonian, Hungarian, Polish and Romanian; the design of the corpus was based on the SAM project for EU languages. Our project was given the rather over-used name BABEL ( I remember having the name foisted on us by an EU advisor at a planning meeting in Luxembourg). I have written a little Wikipedia article called The BABEL Speech Corpus to commemorate it.
A blog that discusses problems in Wikipedia's coverage of Phonetics
Emeritus Professor of Phonetics