ESEIG - UTC Física e Eletrónica
Permanent URI for this community
Browse
Browsing ESEIG - UTC Física e Eletrónica by Title
Now showing 1 - 10 of 13
Results Per Page
Sort Options
- Adaptive filtering for high quality HMM based speech synthesisPublication . Coelho, Luís; Braga, DanielaIn this work an adaptive filtering scheme based on a dual Discrete Kalman Filtering (DKF) is proposed for Hidden Markov Model (HMM) based speech synthesis quality enhancement. The objective is to improve signal smoothness across HMMs and their related states and to reduce artifacts due to acoustic model's limitations. Both speech and artifacts are modelled by an autoregressive structure which provides an underlying time frame dependency and improves time-frequency resolution. Themodel parameters are arranged to obtain a combined state-space model and are also used to calculate instantaneous power spectral density estimates. The quality enhancement is performed by a dual discrete Kalman filter that simultaneously gives estimates for the models and the signals. The system's performance has been evaluated using mean opinion score tests and the proposed technique has led to improved results.
- Adaptive modeling and high quality spectral estimation for speech enhancementPublication . Coelho, Luis; Braga, DanielaIn this work an adaptive modeling and spectral estimation scheme based on a dual Discrete Kalman Filtering (DKF) is proposed for speech enhancement. Both speech and noise signals are modeled by an autoregressive structure which provides an underlying time frame dependency and improves time-frequency resolution. The model parameters are arranged to obtain a combined state-space model and are also used to calculate instantaneous power spectral density estimates. The speech enhancement is performed by a dual discrete Kalman filter that simultaneously gives estimates for the models and the signals. This approach is particularly useful as a pre-processing module for parametric based speech recognition systems that rely on spectral time dependent models. The system performance has been evaluated by a set of human listeners and by spectral distances. In both cases the use of this pre-processing module has led to improved results.
- An automatic voice pleasantness classification system based on prosodic and acoustic patterns of voice preferencePublication . Coelho, Luis; Braga, Daniela; Sales-Dias, Miguel; Garcia-Mateo, CarmenIn the last few years the number of systems and devices that use voice based interaction has grown significantly. For a continued use of these systems the interface must be reliable and pleasant in order to provide an optimal user experience. However there are currently very few studies that try to evaluate how good is a voice when the application is a speech based interface. In this paper we present a new automatic voice pleasantness classification system based on prosodic and acoustic patterns of voice preference. Our study is based on a multi-language database composed by female voices. In the objective performance evaluation the system achieved a 7.3% error rate.
- Automatic syllabification for danish text-to-speech systemsPublication . Beck, Jeppe; Braga, Daniela; Nogueira, João; Sales-Dias, Miguel; Coelho, LuísIn this paper, a rule-based automatic syllabifier for Danish is described using the Maximal Onset Principle. Prior success rates of rule-based methods applied to Portuguese and Catalan syllabification modules were on the basis of this work. The system was implemented and tested using a very small set of rules. The results gave rise to 96.9% and 98.7% of word accuracy rate, contrary to our initial expectations, being Danish a language with a complex syllabic structure and thus difficult to be rule-driven. Comparison with data-driven syllabification system using artificial neural networks showed a higher accuracy rate of the former system.
- CardioML: integrating personal cardiac information for ubiquous diagnosis and analysisPublication . Coelho, Luis; Queirós, RicardoThe latest medical diagnosis devices enable the performance of e-diagnosis making the access to these services easier, faster and available in remote areas. However this imposes new communications and data interchange challenges. In this paper a new XML based format for storing cardiac signals and related information is presented. The proposed structure encompasses data acquisition devices, patient information, data description, pathological diagnosis and waveform annotation. When compared with similar purpose formats several advantages arise. Besides the full integrated data model it may also be noted the available geographical references for e-diagnosis, the multi stream data description, the ability to handle several simultaneous devices, the possibility of independent waveform annotation and a HL7 compliant structure for common contents. These features represent an enhanced integration with existent systems and an improved flexibility for cardiac data representation.
- Effective swimmer’s action during the grab start techniquePublication . Mourão, Luis; Jesus, Karla de; Roesler, Hélio; Machado, Leandro J.; Fernandes, Ricardo J.; Vilas-Boas, João Paulo; Vaz, Mário A. P.The external forces applied in swimming starts have been often studied, but using direct analysis and simple interpretation data processes. This study aimed to develop a tool for vertical and horizontal force assessment based on the swimmers' propulsive and structural forces (passive forces due to dead weight) applied during the block phase. Four methodological pathways were followed: the experimented fall of a rigid body, the swimmers' inertia effect, the development of a mathematical model to describe the outcome of the rigid body fall and its generalization to include the effects of the inertia, and the experimental swimmers' starting protocol analysed with the inclusion of the developed mathematical tool. The first three methodological steps resulted in the description and computation of the passive force components. At the fourth step, six well-trained swimmers performed three 15 m maximal grab start trials and three-dimensional (3D) kinetic data were obtained using a six degrees of freedom force plate. The passive force contribution to the start performance obtained from the model was subtracted from the experimental force due to the swimmers resulting in the swimmers' active forces. As expected, the swimmers' vertical and horizontal active forces accounted for the maximum variability contribution of the experimental forces. It was found that the active force profile for the vertical and horizontal components resembled one another. These findings should be considered in clarifying the active swimmers' force variability and the respective geometrical profile as indicators to redefine steering strategies.
- ezGo: A voice operated wheelchair with biosignal monitoring for home environmentsPublication . Coelho, Luis; Braga, DanielaIn this paper we present ezGo, an electric powered wheelchair with a speech based interface and biosignal monitoring instrumentation. The user can use the voice, a natural communication method, for controlling the chair movement and obtain information about his health. Additionally a set of semi-autonomous modes with macro recording enable the execution of navigation tasks with little effort and improved precision. The main purpose of the system is to provide severely disabled persons with an assistive device that can improve their confidence and daily independence. The obtained results on usability tests showed that users consider ezGo a valuable help on their daily tasks and a very desirable addition to standard wheelchairs.
- Homograph ambiguity resolution in front-end design for portuguese TTS systemsPublication . Braga, Daniela; Coelho, Luis; Resende Jr., Fernando Gil V.In this paper, a module for homograph disambiguation in Portuguese Text-to-Speech (TTS) is proposed. This module works with a part-of-speech (POS) parser, used to disambiguate homographs that belong to different parts-of-speech, and a semantic analyzer, used to disambiguate homographs which belong to the same part-of-speech. The proposed algorithms are meant to solve a significant part of homograph ambiguity in European Portuguese (EP) (106 homograph pairs so far). This system is ready to be integrated in a Letter-to-Sound (LTS) converter. The algorithms were trained and tested with different corpora. The obtained experimental results gave rise to 97.8% of accuracy rate. This methodology is also valid for Brazilian Portuguese (BP), since 95 homographs pairs are exactly the same as in EP. A comparison with a probabilistic approach was also done and results were discussed.
- Kalman tracking linear predictor for vowel intelligibility enhancement on european portuguese HMM based speech synthesisPublication . Coelho, Luís; Braga, Daniela; Garcia-Mateo, CarmenThe recent developments on Hidden Markov Models (HMM) based speech synthesis showed that this is a promising technology fully capable of competing with other established techniques. However some issues still lack a solution. Several authors report an over-smoothing phenomenon on both time and frequencies which decreases naturalness and sometimes intelligibility. In this work we present a new vowel intelligibility enhancement algorithm that uses a discrete Kalman filter (DKF) for tracking frame based parameters. The inter-frame correlations are modelled by an autoregressive structure which provides an underlying time frame dependency and can improve time-frequency resolution. The system’s performance has been evaluated using objective and subjective tests and the proposed methodology has led to improved results.
- Massive Open Online Course management: learning science and engineering through peer-reviewed projectsPublication . Pessoa, Ana S. Moura; Coelho, Luís; Fernandes, RubenMassive Open Online Courses (MOOC) are gaining prominence in transversal teaching-learning strategies. However, there are many issues still debated, namely assessment, recognized largely as a cornerstone in Education. The large number of students involved requires a redefinition of strategies that often use approaches based on tasks or challenging projects. In these conditions and due to this approach, assessment is made through peer-reviewed assignments and quizzes online. The peer-reviewed assignments are often based upon sample answers or topics, which guide the student in the task of evaluating peers. This chapter analyzes the grading and evaluation in MOOCs, especially in science and engineering courses, within the context of education and grading methodologies and discusses possible perspectives to pursue grading quality in massive e-learning courses.