Soon We Will be Able to Design Custom Sounds with Voice And Gesture

The first thing an architect or graphic designer will do at the start of a project is to produce some preliminary sketches — just to rough out their ideas on paper, perhaps augmented with computer-aided design software. But sound designers don’t have similar tools. A consortium of European researchers is seeking to change that by developing a suite of sketching tools for sound, based on voice and gestures.

“If you are an architect and want to sketch a house, you can simply draw it on a sketchpad,” the researchers wrote in a summary of their work. “But what do you do if you are a sound designer and want to rapidly sketch the sound of a new motorbike?” The usual tools — synthesizers, samplers, and sequences, for instance — are complicated and require considerable training to use. They’re just not as simple, quick, and intuitive as a sketch pad.

Sound is difficult to describe in words, which is why most of us resort to a combination of gesture and vocal mimicry when, say, trying to convey to someone else that a car goes vrooom. The human voice is like a built-in sound synthesizer.

“People can recognize fairly well what a person imitates,” Guillaume Lemaitre, a researcher at Ircam in Paris, France, told Gizmodo via email. “So our dream tool would be a synthesizer that we could directly interact with, [using] our voice and gestures, just as what we do naturally when we talk to someone. Ideally, this synthesizer would understand the imitations the same way a person would do, and create sounds accordingly.”

That’s the goal of SkAT-VG (Sketching Audio Technologies with Voice and Gestures), a three-year interdisciplinary collaborative project between four partners. Ircam is responsible for aspects involving perception psychology, gesture analysis, signal processing, and machine learning. The Royal Institute of Technology (KTH) in Stockholm, Sweden, is handling the phonetics, while Iuav University of Venice, Italy, focuses on sound design and sound synthesis. And Genesis, a company based in Aix-en-Provence that conducts sound studies and develops audio technologies for sound design, is in charge of user studies and prototype integration.

The first step is gaining a better understanding of how people use mimicry and gesture to communicate different sounds. So Lemaitre and his Ircam colleagues rounded up 50 volunteers and had them listen to recorded sounds, then imitate those sounds. There were mechanical sounds (like tapping and scraping), sounds of common objects (cars, blenders and saws) and also computer sounds, like sound effects in video games. All the participants were filmed with a GoPro camera, and fitted with a body-tracking kinect and accelerometers attached to their wrists. They also captured the process on video:

Lemaitre admits that they had some misconceptions going into the study. For instance, “We initially thought that people would draw the trajectory of some acoustical features — like pitch or the intensity — with their hands in the air, like raising your hand to imitate pitch going up,” he said. But this proved not to be the case. Instead, gestures were used more for emphasis, in a metaphorical fashion stereotypically associated with Italian characters in film and television. “They seemed to be more like symbols that indicate certain overall properties of the sounds,” Lemaitre said.

Based on that, he and his colleagues concluded that gestures would not be particularly useful as a means of precisely controlling the behavior of a synthesizer in real time, as the consortium members originally thought would be possible. Vocal imitations are far more effective for that purpose. “Voice can reproduce accurately higher tempos than gestures, and is more precise than gestures when reproducing complex rhythmic patterns,” according to Lemaitre’s summary.

The next step is to build actual prototypes of the sketching tools, based on what’s been learned so far, and test how well they work in real-world conditions. Lemaitre said the consortium will hold a special event this spring in the south of France, specifically for sound designers, giving them the task of creating specific sounds with the prototype tools and evaluating the pros and cons of the prototypes.

Practical uses aside, Lemaitre thinks studies of vocal imitations and gestures might also prove beneficial for neuroscientists interested in auditory perception and cognition. Studies like the one above could improve our understanding how sounds are encoded in memory.

Reference:

Rocchesso, D., Lemaitre, G., Susini, P., Ternström, S., & Boussard, P. (2015) “Sketching Sound with Voice and Gesture,” Interactions 22(1): 38-41.

[Via Acoustical Society of America]

Image: View Apart/Shutterstock

Soon We Will be Able to Design Custom Sounds with Voice And Gesture

Sign up for our newsletters

Latest news

Ecovacs Brings Robot Vacuum and Mop Prime Day Deals With Nearly $600 Off DEEBOT X12 OmniCyclone

Philips Cuts 50% Off the Sonicare 7300 Series Electric Toothbrush for Prime Members First, Premium Oral Care With 12 Brushing Settings for Less

Sonos Move 2 Drops to a Record Low as Amazon Pushes Best-Selling Portable Bluetooth Speaker Deals for Prime Day

China’s Ministry of State Security Accuses ‘Spy Turtles and Spy Fish’ of Stealing Sensitive Marine Data

Everyday Robovac Champions—Roborock Qrevo S Pro & QV 35A Head to Head

How to Watch Turkey vs Paraguay Free Live Stream: the 2026 World Cup From Anywhere

How to Watch Brazil vs Haiti Free Livestream from Anywhere

‘Vampire: The Masquerade’ Officially Joins ‘Dungeons & Dragons’

Latest Reviews

Logitech’s Folding Travel Mouse Fails at the One Thing That Matters Most

Maingear MG-1 (2026) Review: A Clean, Serene Gaming Machine for a Premium Price

Segway Navimow X430 Review: A Featureful Mow-Bot

GoPro Mission 1 Pro Review: The Best GoPro, Just Not the Best Camera

The Best Tech Gifts for Father’s Day 2026

This Ultrasonic Knife Is More Than Just a Gimmick—at Least Sometimes

Razer Hammerhead V3 HyperSpeed Review: Exceptionally Mediocre

The Best Gadgets of May 2026

Related Articles

Soon We Will be Able to Design Custom Sounds with Voice And Gesture

Sign up for our newsletters

Ecovacs Brings Robot Vacuum and Mop Prime Day Deals With Nearly $600 Off DEEBOT X12 OmniCyclone

Philips Cuts 50% Off the Sonicare 7300 Series Electric Toothbrush for Prime Members First, Premium Oral Care With 12 Brushing Settings for Less

Sonos Move 2 Drops to a Record Low as Amazon Pushes Best-Selling Portable Bluetooth Speaker Deals for Prime Day

China’s Ministry of State Security Accuses ‘Spy Turtles and Spy Fish’ of Stealing Sensitive Marine Data

Everyday Robovac Champions—Roborock Qrevo S Pro & QV 35A Head to Head

How to Watch Turkey vs Paraguay Free Live Stream: the 2026 World Cup From Anywhere

How to Watch Brazil vs Haiti Free Livestream from Anywhere

‘Vampire: The Masquerade’ Officially Joins ‘Dungeons & Dragons’

Logitech’s Folding Travel Mouse Fails at the One Thing That Matters Most

Maingear MG-1 (2026) Review: A Clean, Serene Gaming Machine for a Premium Price

Segway Navimow X430 Review: A Featureful Mow-Bot

GoPro Mission 1 Pro Review: The Best GoPro, Just Not the Best Camera

The Best Tech Gifts for Father’s Day 2026

This Ultrasonic Knife Is More Than Just a Gimmick—at Least Sometimes

Razer Hammerhead V3 HyperSpeed Review: Exceptionally Mediocre

The Best Gadgets of May 2026

Related Articles

I Let a Brain-Scanning Headset ‘Prime’ My Focus to Make Me a Better Gamer

The Best Tech Gifts for Father’s Day 2026

Nobel Prizes: 5 Unlikely Winner Reactions, From the Unbothered to the Downright Mad

An Artist Claims to Have Created Paint in a ‘New’ Impossible Hue Conjured by Scientists

Scientists Agree That Everyone Hates Your Terrible Zoom Mic

This $35,000 Computer Is Powered by Trapped Human Brain Cells