Apr 29, 2010 2:51 AM

Web Semantics: speech in the cloud

*Now that there's a ton of voice traffic on the Internet, major-league search-engine techniques can be brought to bear on it.

*That's a little creepy.

http://radar.oreilly.com/2010/04/big-data-shakes-up-the-speech-industry.html

(...)

"Having speech technologies in the cloud lets Google quickly iterate and push enhanced speech engines on a regular basis. More importantly, their speech engines learn and get trained using real data from their many interconnected services. Speech engines typically rely on both language and acoustic models. Language models are statistical models of word sequences and patterns. Cohen pointed out that their language models use data collected from web searches, giving them access to an ever growing corpus that few can match (230 billion words collected, refined to a vocabulary of the million most common words).

(((So, what kind of mixmaster, guitar-pedal effects can one get out of a "spoken language model," one wonders. Like, if you had the comprehensive archive of all phone calls made to French cops, could you use one to call a French cop, even if you didn't speak French? Maybe you could just input a few factual parameters about the incident and have it generate French for you.)))

(((Then there are the obvious machine-generated phone-sex applications. I was writing sci-fi about those twenty years ago.)))

"Cohen disclosed that some of the more recent acoustic models they're evaluating are built using unsupervised machine-learning algorithms. (These are speech algorithms trained on recorded speech that haven't been transcribed by hand.) (((Oh dear.))) While he coyly avoided explaining how an accurate system can be built from unsupervised techniques, it's likely they use data from their 411 service....

Your Vape Wants to Know How Old You Are

Companies hope that biometric age-verification tech in cartridges could put flavored vapes back in business. But it's unlikely to solve the real problems.

Boone Ashworth

Exclusive LegalZoom Promo Code for 10% Off Services for April

Save on top services at LegalZoom, like LLC registration, incorporation, estate plans, and more with coupons and deals from WIRED.

Parker Hall

US Takes Down Botnets Used in Record-Breaking Cyberattacks

The Aisuru, Kimwolf, JackSkid, and Mossad botnets had infected more than 3 million devices in total, many inside home networks, according to the US Justice Department.

Andy Greenberg

Anduril Wants to Own the Future of War Tech. Mishaps, Delays, and Challenges Abound

From drones to missiles to submarines, the $30.5 billion defense startup wants to transform how the tools of war are made. It’s not all going as planned.

Paresh Dave

FCC Enforcement Chief Offered to Help Brendan Carr Target Disney, Records Show

Last year, as FCC chair Brendan Carr threatened ABC over a Jimmy Kimmel monolog, a civil servant overseeing West Coast stations privately pledged support, according to emails obtained by WIRED.

Dell Cameron

Opposing ICE Might Save the Country. It Could Also Ruin Your Life

For months, lone vibe coder Rafael Concepcion has obsessively built tools to counter the federal immigration crackdown—pivoting as he’s been outmatched. He’s also lost his job and become a target.

Brendan I. Koerner

Nobody Knows How to File Taxes on Prediction Market Wins

Americans flocked to prediction markets last year. Now, it’s time to pay taxes on winnings. How do you do that? Great question.

Kate Knibbs

Livestream Replay: The War Machine

A panel of WIRED experts dissected the defense tech industry’s impact on modern warfare.

Tim Marchman

Uncanny Valley: OpenAI and Musk Fight Again; DOJ Mishandles Voter Data; Artemis II Comes Home

In this episode, the hosts discuss the fight between OpenAI and Elon Musk, the misuse of voter data, and Artemis II’s moonshot.

Brian Barrett

The Trajectory of the Artemis II Moon Mission Is a Feat of Engineering

The astronauts will arrive about 10,300 kilometers beyond our satellite, breaking all previous records for distance from Earth. But how was their route chosen?

Luca Nardi

A Billionaire-Backed Startup Wants to Grow 'Organ Sacks' to Replace Animal Testing

R3 Bio has a bold idea for replacing lab animals: genetically-engineered whole organ systems that lack a brain. The long-term goal, says a cofounder, is to make human versions.

Emily Mullin

Firewire's Neutrino Looks Like an Ironing Board and Takes Off Like a Shot

Firewire makes the most innovative surfboards in the industry. This winter, I tried the Neutrino, Machado, and Revo Max to see if they're worth the hype.

Brent Rose