Reconhecimento de fala: mudanças entre as edições

Edição das 15h12min de 24 de dezembro de 2004

Tecnologias de reconhecimento de voz permitem que computadores equipados com microfones interpretem a fala humana , por exemplo, para transcrição ou como método de comando por voz. Tais sistemas podem ser classificados por requererem ou não que o usuário treine o sistema a reconhecer seu padrões particulares de fala,por ter a habilidade de reconhecer fala contínua ou por requerer quer o usuário fale pausadamente. e pelo tamanho do vocabulário que é capaz de reconhecer(pequeno, da ordem de dezenas a centenas de palavras, ou grande, com milhares de palavras).

Systems requiring a short amount of training can (as of 2001) capture continuous speech with a large vocabulary at normal pace with an accuracy of about 98% (getting two words in one hundred wrong), and different systems that require no training can recognize a small number of words (for instance, the ten digits of the decimal system) as spoken by most English speakers. Such systems are popular for routing incoming phone calls to their destinations in large organisations.

Commercial systems for speech recognition have been available off-the-shelf since the 1990s. However, it is interesting to note that despite the apparent success of the technology, few people use such speech recognition systems.

It appears that most computer users can create and edit documents more quickly with a conventional keyboard, despite the fact that most people are able to speak considerably faster than they can type. Additionally, heavy use of the speech organs results in vocal loading.

Some of the key technical problems in speech recognition are that:

Inter-speaker differences are often large and difficult to account for. It is not clear which characteristics of speech are speaker-independent.
The interpretation of many phonemes, words and phrases are context sensitive. For example, phonemes are often shorter in long words than in short words. Words have different meanings in different sentences, e.g. "Philip lies" could be interpreted either as Philip being a liar, or that Philip is lying on a bed.
Intonation and speech timbre can completely change the correct interpretation of a word or sentence, e.g. "Go!", "Go?" and "Go." can clearly be recognised by a human, but not so easily by a computer.
Words and sentences can have several valid interpretations such that the speaker leaves the choice of the correct one to the listener.
Written language may need punctuation according to strict rules that are not strongly present in speech, and are difficult to infer without knowing the meaning (commas, ending of sentences, quotations).

The "understanding" of the meaning of spoken words is regarded by some as a separate field, that of natural language understanding. However, there are many examples of sentences that sound the same, but can only be disambiguated by an appeal to context: one famous T-shirt worn by Apple Computer researchers stated,

I helped Apple wreck a nice beach,

which, when spoken, sounds like I helped Apple recognize speech.

A general solution of many of the above problems effectively requires human knowledge and experience, and would thus require advanced artificial intelligence technologies to be implemented on a computer. In particular, statistical language models are often employed for disambiguation and improvement of the recognition accuracies.

Veja também

Processamento de Sinais de Áudio
- Processamento de Fala
- Síntese de Voz (o oposto de reconhecimento de voz)
Lingüística Computacional
Processamento Digital de sinais
Dynamic time warping
Hidden Markov models
Lingüística
Mondegreen
Mel Frequency Cepstral Coefficients (MFCCs)
Reconhecimento de Padrões
ViaVoice
Análise de Voz
Dispositivo de Comando de Voz

External links

Statistical Language Modeling (Natural Language Processing Lab, Northeastern University, China)

da:Talegenkendelse de:Spracherkennung en:Speech recognition es:Comprensión del lenguaje fi:Puheentunnistus fr:Reconnaissance vocale ja:音声認識 nl:Spraakherkenning

@@ Linha 1: / Linha 1: @@
-[[da:Talegenkendelse]]
-[[de:Spracherkennung]]
-[[es:Comprensión del lenguaje]]
-[[fr:Reconnaissance vocale]]
-[[ja:&#38899;&#22768;&#35469;&#35672;]]
-[[fi:Puheentunnistus]]
-[[en:Speech recognition]]
 [[Categoria:Inteligência Artificial]]
@@ Linha 53: / Linha 45: @@
 == External links ==
 * [http://www.nlplab.cn/zhangle/slm.html Statistical Language Modeling (Natural Language Processing Lab, Northeastern University, China)]
+[[da:Talegenkendelse]]
+[[de:Spracherkennung]]
+[[en:Speech recognition]]
+[[es:Comprensión del lenguaje]]
+[[fi:Puheentunnistus]]
+[[fr:Reconnaissance vocale]]
+[[ja:音声認識]]
+[[nl:Spraakherkenning]]

Reconhecimento de fala: mudanças entre as edições

Edição das 15h12min de 24 de dezembro de 2004

Veja também

External links

O que estudar para o enem 2023

Qual melhor curso para fazer em 2023

Enem: Conteúdos E Aulas On-Line São Opção Para Os Estudantes

Como Fazer Uma Carta De Apresentação

Como Escrever Uma Boa Redação

Concurso INSS edital 2022 publicado

ARTIGOS DE TENDÊNCIA

Resultado do Enem 2023: Saí nesta terça-feira

Concurso Unificado: inscrições serão aceitas pelo GOV.BR

Como fazer uma redação passo a passo para concurso

Permaneça conectado

Parceiros

Reconhecimento de fala: mudanças entre as edições

Edição das 15h12min de 24 de dezembro de 2004

Veja também

External links

talvez você goste

Assine

ARTIGOS DE TENDÊNCIA

Permaneça conectado

Facebook

Parceiros