Follow Us

We use cookies to provide you with a better experience. If you continue to use this site, we'll assume you're happy with this. Alternatively, click here to find out how to manage these cookies

hide cookie message

Alarming synthesized speech tells you when to freak out

Fujitsu technology could work with cloud data to tailor synthesized speech to a given situation

Article comments

Fujitsu has developed technology that makes synthesized speech sound a little more natural and less robotic by adopting the appropriate tone for different situations.

For instance, if there's an emergency it would select an alarming tone of voice. In a noisy environment, it would choose to speak louder and more clearly. A tranquil environment might elicit a relaxing tone.

To create the voice, the technology uses machine learning algorithms to analyze the patterns of natural speech and extract voice characteristics relatively quickly from a small voice sample. From those characteristics, a high-quality synthetic voice can be built and imbued with the right tone for a given situation, Fujitsu said.

The synthesis technology could work with local or cloud data so that a voice system could give out information with the appropriate tone and pitch. It might inform factory workers of a valve problem, for example, by adopting an urgent tone of voice.

An audio sample in Japanese presents a synthesized voice repeating a warning about a duct error with increasing urgency, rising in pitch and speed.

In a country like Japan, where everything from escalators to trucks issue automated voice warnings to users or people nearby, making synthetic voice systems aware of environmental and other data could make them more effective. But the technology could be used wherever synthetic voices are needed.

"Some examples of potential applications for this technology are voice-based work support solutions in factories and other work environments, natural disaster-related broadcast solutions, car navigation systems, text-to-speech services for online content, text-to-speech email solutions and automated messaging services," a Fujitsu spokesman said.

The know-how might also lead to applications that clone a person's natural voice, which would be useful for those who are losing their ability to speak due to illness.

It would take about three to five hours of someone reading out a text to create a database, the spokesman said, after which the person's voice could be cloned. If the user were to lose the ability to speak, a PC could speak typed words with the cloned voice.

Before he died in 2013, movie critic Roger Ebert, who had lost his voice to cancer, used a system developed by Scottish text-to-speech company CereProc to recreate his voice based on DVD film commentaries he had recorded.

Fujitsu plans to commercialize its voice synthesis technology in its fiscal year that ends in March 2015.

It is also looking at the possibility of expanding the know-how to be used as a cloud service, the spokesman added.



Share:

More from Techworld

More relevant IT news

Comments



Send to a friend

Email this article to a friend or colleague:

PLEASE NOTE: Your name is used only to let the recipient know who sent the story, and in case of transmission error. Both your name and the recipient's name and address will not be used for any other purpose.

Techworld White Papers

Choose – and Choose Wisely – the Right MSP for Your SMB

End users need a technology partner that provides transparency, enables productivity, delivers...

Download Whitepaper

10 Effective Habits of Indispensable IT Departments

It’s no secret that responsibilities are growing while budgets continue to shrink. Download this...

Download Whitepaper

Gartner Magic Quadrant for Enterprise Information Archiving

Enterprise information archiving is contributing to organisational needs for e-discovery and...

Download Whitepaper

Advancing the state of virtualised backups

Dell Software’s vRanger is a veteran of the virtualisation specific backup market. It was the...

Download Whitepaper

Techworld UK - Technology - Business

Innovation, productivity, agility and profit

Watch this on demand webinar which explores IT innovation, managed print services and business agility.

Techworld Mobile Site

Access Techworld's content on the move

Get the latest news, product reviews and downloads on your mobile device with Techworld's mobile site.

Find out more...

From Wow to How : Making mobile and cloud work for you

On demand Biztech Briefing - Learn how to effectively deliver mobile work styles and cloud services together.

Watch now...

Site Map

* *