EDAboard.com | EDAboard.eu | EDAboard.de | EDAboard.co.uk | RTV forum PL | NewsGroups PL

speech recognition

Ask a question - edaboard.com

elektroda.net NewsGroups Forum Index - Electronics Design - speech recognition

Goto page 1, 2, 3  Next

RichD
Guest

Thu Jan 19, 2012 6:51 am   



http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL


He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?


--
Rich

Richard Owlett
Guest

Thu Jan 19, 2012 1:28 pm   



RichD wrote:
Quote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL


He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

If notch filter, why requirement for second microphone ;)


Quote:

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?


--
Rich


Jan Panteltje
Guest

Thu Jan 19, 2012 1:38 pm   



On a sunny day (Wed, 18 Jan 2012 20:51:41 -0800 (PST)) it happened RichD
<r_delaney2001_at_yahoo.com> wrote in
<4b7edabe-1efe-4e39-9869-fcab7035b613_at_pt5g2000pbb.googlegroups.com>:

Quote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL


He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?

I have read about research like that long time ago,
and it was a new type of neural net where the communication
is not in spikes quantity, but used frequency modulated spikes (IIRC).
That was supposed to come close to how the real neurons worked
in some part of the cortex.
It was taken by DOD and used for submarine detection
and detection of where shots came from from snipers.
Then I could no longer find any publications about it on the net.
That system was employed in Iraq..
Could be the basis of things developed now.

Crossposting sucks, do not do it.

Capt. Cave Man
Guest

Thu Jan 19, 2012 2:29 pm   



On Thu, 19 Jan 2012 06:28:12 -0600, Richard Owlett <rowlett_at_pcnetinc.com>
wrote:

Quote:
RichD wrote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL


He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

If notch filter, why requirement for second microphone ;)


Probably not a filter, but the second mike could be helpful in

establishing a baseline for what is 'noise' compared to what my be
originating from a point source.

Should be some "voice pattern" smarts capable of deciding what data IS
from a human source and segregating it from the rest.



Quote:

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?


--
Rich

Goddamned overtly cross posting twit.

Phil Hobbs
Guest

Thu Jan 19, 2012 2:35 pm   



RichD wrote:
Quote:

http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL

He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?

--
Rich

A friend of mine, Professor Dana Anderson of the University of Colorado,
Boulder, made a statistics-based digital filter that could separate
different kinds of music mixed together, as well as music from noise.
The demo was really striking--you mix together, say jazz and classical
music from two MP3 players, feed it through the gizmo, and after (iirc)
about 10 seconds of learning, classical comes out of one speaker and
jazz out of the other. Magic stuff--published in IEEE Acoustics around
2006, I think.

Cheers

Phil Hobbs
--
Dr Philip C D Hobbs
Principal Consultant
ElectroOptical Innovations LLC
Optics, Electro-optics, Photonics, Analog Electronics

160 North State Road #203
Briarcliff Manor NY 10510
845-480-2058

hobbs at electrooptical dot net
http://electrooptical.net

Jesse F. Hughes
Guest

Thu Jan 19, 2012 2:57 pm   



Phil Hobbs <pcdhSpamMeSenseless_at_electrooptical.net> writes:

Quote:
A friend of mine, Professor Dana Anderson of the University of Colorado,
Boulder, made a statistics-based digital filter that could separate
different kinds of music mixed together, as well as music from noise.
The demo was really striking--you mix together, say jazz and classical
music from two MP3 players, feed it through the gizmo, and after (iirc)
about 10 seconds of learning, classical comes out of one speaker and
jazz out of the other. Magic stuff--published in IEEE Acoustics around
2006, I think.

That sounds really impressive, if it works as well as you describe.

--
Jesse F. Hughes
"Mathematicians who read proofs of my results seem to basically lose
some part of themselves, like it rips at their souls, and they are no
longer quite right in the head." -- James S. Harris, Geek Cthulhu

David Bernier
Guest

Thu Jan 19, 2012 3:17 pm   



On Jan 18, 11:51 pm, RichD <r_delaney2...@yahoo.com> wrote:
Quote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO2....

He claims he can filter speech from background noise.

I recall discussing this possibility years ago.  Someone said, these
filters already exist.  They do - they're notch filters!  It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain.  Anyone
here know anything about the subject, or this  product?
Is it neural nets, DSP filters, or what?
[...]


There was an article:
"Advanced Noise Reduction for Mobile Telephony"
in IEEE Computer in 2008 by Lloyd Watts.
It's also at
< http://www.lloydwatts.com/ >
under "Publications"; maybe my browser has a
problem, as I can't download the 2008 paper
as a PDF file ...

Audience Inc. (which makes the chips) mentions
a proprietary “Fast Cochlea Transform™” (TM)
here:
< http://www.audience.com/technology/fast-cochlea.php >

At&T sells cell-phones with Audience chipsets.
I don't have any personal experience with
these kinds of advanced phones.

David Bernier

Graham Cooper
Guest

Thu Jan 19, 2012 3:24 pm   



On Jan 19, 11:17 pm, David Bernier <david53...@gmail.com> wrote:
Quote:
On Jan 18, 11:51 pm, RichD <r_delaney2...@yahoo.com> wrote:>http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO2...

He claims he can filter speech from background noise.

I recall discussing this possibility years ago.  Someone said, these
filters already exist.  They do - they're notch filters!  It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain.  Anyone
here know anything about the subject, or this  product?
Is it neural nets, DSP filters, or what?

[...]

There was an article:
"Advanced Noise Reduction for Mobile Telephony"
in IEEE Computer in 2008 by Lloyd Watts.
It's also at
http://www.lloydwatts.com/
under "Publications"; maybe my browser has a
problem, as I can't download the 2008 paper
as a PDF file ...

Audience Inc. (which makes the chips) mentions
a proprietary “Fast Cochlea Transform™” (TM)
here:
http://www.audience.com/technology/fast-cochlea.php

At&T sells cell-phones with Audience chipsets.
I don't have any personal experience with
these kinds of advanced phones.

David Bernier



I had a 5 minute conversation with the Credit Card initiation bot.

Bottom up noise reduction is mathematics/electronics.

Top Down speech matching is AI. Generating likely responses for the
domain of the discussion topic.

The theory behind the former is 99% worked out, the theory behind the
latter is 1% worked out!

Herc

Eric Jacobsen
Guest

Thu Jan 19, 2012 4:58 pm   



On Thu, 19 Jan 2012 06:28:12 -0600, Richard Owlett
<rowlett_at_pcnetinc.com> wrote:

Quote:
RichD wrote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL


He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

If notch filter, why requirement for second microphone Wink

Looks like maybe a combination of adaptive noise cancelling and maybe
a little bit of beamforming. Definitely not a notch filter.


Quote:
Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?


--
Rich

Eric Jacobsen
Anchor Hill Communications
www.anchorhill.com

Bitrex
Guest

Thu Jan 19, 2012 5:28 pm   



On 1/19/2012 8:35 AM, Phil Hobbs wrote:
Quote:
RichD wrote:

http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL

He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?

--
Rich

A friend of mine, Professor Dana Anderson of the University of Colorado,
Boulder, made a statistics-based digital filter that could separate
different kinds of music mixed together, as well as music from noise.
The demo was really striking--you mix together, say jazz and classical
music from two MP3 players, feed it through the gizmo, and after (iirc)
about 10 seconds of learning, classical comes out of one speaker and
jazz out of the other. Magic stuff--published in IEEE Acoustics around
2006, I think.

Cheers

Phil Hobbs

Not exactly the same, but in a similar vein I have experimented with the
following software: http://www.celemony.com/cms/

It allows you to take say, a single track audio file of a complex
strummed guitar chord, separate out the chord into its constituent
notes, manipulate the frequencies of the constituent notes, and then
reassemble the whole thing back into a single audio file.

Curt Welch
Guest

Thu Jan 19, 2012 7:05 pm   



"Jesse F. Hughes" <jesse_at_phiwumbda.org> wrote:
Quote:
Phil Hobbs <pcdhSpamMeSenseless_at_electrooptical.net> writes:

A friend of mine, Professor Dana Anderson of the University of
Colorado, Boulder, made a statistics-based digital filter that could
separate different kinds of music mixed together, as well as music from
noise. The demo was really striking--you mix together, say jazz and
classical music from two MP3 players, feed it through the gizmo, and
after (iirc) about 10 seconds of learning, classical comes out of one
speaker and jazz out of the other. Magic stuff--published in IEEE
Acoustics around 2006, I think.

That sounds really impressive, if it works as well as you describe.

Here's a great little web demo of ICA - Independent Component Analyses. It
can separate sources mixed together when recorded in different
"microphones" (I assume the demo is just a mathematical mixing and not done
by recording).

http://research.ics.tkk.fi/ica/cocktail/cocktail_en.cgi

This approach makes the assumption that the source signals are linearly
mixed together at different levels in each microphone recording (due to the
different distances each source is away from the microphone) but can
separate as many different sources as you have microphones.

More info:

http://en.wikipedia.org/wiki/Independent_component_analysis

I would guess the telephone technology is using something similar since
they added a second microphone.

The only statistical requirement for this to work is that the sources must
have a non-Gaussian distribution.

BTW, this stuff is WAY past "notch filters" in complexity and power and
performance.

--
Curt Welch http://CurtWelch.Com/
curt_at_kcwc.com http://NewsReader.Com/

maury
Guest

Thu Jan 19, 2012 10:18 pm   



On Jan 18, 10:51 pm, RichD <r_delaney2...@yahoo.com> wrote:
Quote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO2....

He claims he can filter speech from background noise.

I recall discussing this possibility years ago.  Someone said, these
filters already exist.  They do - they're notch filters!  It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain.  Anyone
here know anything about the subject, or this  product?
Is it neural nets, DSP filters, or what?

--
Rich

The filter in his paper looks like the mel-frequency cepstrum
coefficients (MFCC) used in speech recognition, and multiplied by a
weighting )weighted MFCC perhaps). I found a paper by Jont Allen on
Cochlear modeling (ASSP Mag., Jan 1985). Maybe Watts is combining the
two concepts (thus the weighting). Additionally, Auience has a patent
(7,076,315) titled "Efficient computation of log-frequency-scale
digital filter cascade". MFCC is log-frequency-scale analysis, and
Watt's paper refers to a propritary bi-quad IIR filter. Hmmmmm

Just a thought.

Maurice Givens

Jamie
Guest

Fri Jan 20, 2012 12:10 am   



RichD wrote:

Quote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO20.DTL


He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?


--
Rich
For what ever reason, I have never been able to get a speech to

text working here well enough with my voice to make it useable with
out detecting errors in miss use of words or at times, totally incorrect
words. But all of them seem to work well with woman voices from what
I've seen.

I understand the training cycle you need to perform in such tools to
build a profile for your voice.

The latest Dragon Speech does seem to work well however, it is not so
much just being able to correlate with my voice, it seems to have issues
deciding what is, as is and what is as CMDS.

The technology has come a long ways and I can remember the first one
I tried, which was for Windows 3.x and found it to work amazingly well
for such things back then. Even the speed response was good however, it
seems that as hardware speeds up, the software gets bloated
proportionally as they add more things, use newer tools that just puts
more un-wanted bloat in your code.

Maybe we'll stop adding layers to these tools one day.

Jamie

Phil Hobbs
Guest

Fri Jan 20, 2012 5:13 am   



HardySpicer wrote:
Quote:

On Jan 20, 2:35 am, Phil Hobbs
pcdhSpamMeSensel...@electrooptical.net> wrote:
RichD wrote:

http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO2...

He claims he can filter speech from background noise.

I recall discussing this possibility years ago. Someone said, these
filters already exist. They do - they're notch filters! It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain. Anyone
here know anything about the subject, or this product?
Is it neural nets, DSP filters, or what?

--
Rich

A friend of mine, Professor Dana Anderson of the University of Colorado,
Boulder, made a statistics-based digital filter that could separate
different kinds of music mixed together, as well as music from noise.
The demo was really striking--you mix together, say jazz and classical
music from two MP3 players, feed it through the gizmo, and after (iirc)
about 10 seconds of learning, classical comes out of one speaker and
jazz out of the other. Magic stuff--published in IEEE Acoustics around
2006, I think.

Cheers

Phil Hobbs
--
Dr Philip C D Hobbs
Principal Consultant
ElectroOptical Innovations LLC
Optics, Electro-optics, Photonics, Analog Electronics

160 North State Road #203
Briarcliff Manor NY 10510
845-480-2058

hobbs at electrooptical dot nethttp://electrooptical.net

Yes but how are the two sources mixed in the first place? Acoustically
or pure computer addition? Important because one of those is
convolutive mixing.

hardy

"Convolutive mixing"? As in use one as a filter for the other? They
were summed, just as in every other audio mixer.

Cheers

Phil Hobbs
--
Dr Philip C D Hobbs
Principal Consultant
ElectroOptical Innovations LLC
Optics, Electro-optics, Photonics, Analog Electronics

160 North State Road #203
Briarcliff Manor NY 10510
845-480-2058

hobbs at electrooptical dot net
http://electrooptical.net

HardySpicer
Guest

Fri Jan 20, 2012 5:22 am   



On Jan 19, 5:51 pm, RichD <r_delaney2...@yahoo.com> wrote:
Quote:
http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2012/01/16/BU8C1MOO2....

He claims he can filter speech from background noise.

I recall discussing this possibility years ago.  Someone said, these
filters already exist.  They do - they're notch filters!  It's close
to brain dead, believing that constitutes 'voice filtering'.

Dr. Watts has been working on this for years, so I was wondering
what techniques he's using, how much is public domain.  Anyone
here know anything about the subject, or this  product?
Is it neural nets, DSP filters, or what?

--
Rich

Couldn't say but a notch filter won't work if it's wideband noise with
spectral overlap.
Things that might work...adaptive beamforming, at a stretch Blind
Source Separation (doubt it though).


Hardy

Goto page 1, 2, 3  Next

elektroda.net NewsGroups Forum Index - Electronics Design - speech recognition

Ask a question - edaboard.com

Arabic versionBulgarian versionCatalan versionCzech versionDanish versionGerman versionGreek versionEnglish versionSpanish versionFinnish versionFrench versionHindi versionCroatian versionIndonesian versionItalian versionHebrew versionJapanese versionKorean versionLithuanian versionLatvian versionDutch versionNorwegian versionPolish versionPortuguese versionRomanian versionRussian versionSlovak versionSlovenian versionSerbian versionSwedish versionTagalog versionUkrainian versionVietnamese versionChinese version
RTV map EDAboard.com map News map EDAboard.eu map EDAboard.de map EDAboard.co.uk map Opony