About Us

Yamagishi Lab

  • TOP
  • Yamagishi Lab

Message

Acoustic Research
and Technology Development that Benefits Society

Our laboratory is more than just a venue for acoustic research.
We are an acoustics laboratory that evolves together with society. We are committed to mid- and long-term scientific exploration of sound and developing advanced technologies that address the complex acoustics-related challenges and societal needs facing Japan.
Our vision is to contribute to the betterment of Japan's future through cutting-edge sound research. Aside from presenting academic papers at top conferences, we also collaborate with Japanese companies on initiatives to translate the fruits of our research into concrete applications for society.
By publishing research data sets and undertaking projects that pioneer innovation, we help drive the advancement of Japanese university acoustics research that paves the way for the future.

Creating research and technology that transforms society

Mission

Contributing to Society Through Acoustic Research
and State-of-the-Art Technology

Our laboratory is committed to its mission to resolve complex sound-related issues affecting Japanese society through academic research and advanced acoustic technologies.
We continuously endeavor to contribute to society, pioneering efforts to tackle issues that cannot be addressed by conventional speech information processing
and develop scientific and technological solutions to address them. Some of the problems we have focused on include deep fakes,
voice cloning, and privacy implications associated with large-scale speech data used to train generative models.
We also provide support for Japanese university research by publishing research data sets and organizing academic challenges as an Inter-University Research Institutional Corporation.

Vision

Acoustic Research
that Paves the Way to the Future and Giving Back to Society

At the National Institute of Informatics's Yamagishi Lab (a national research institute), our vision is to conduct acoustic research and develop state-of-the-art technology rooted in society's mid- and long-term needs.
These R&D initiatives are intended to give back to society by producing seed technologies to better Japan's future and enhance the technological capabilities of Japanese universities and companies.
We aim to be an acoustic research laboratory that evolves together with society for the future of Japan.

Research Environment

Research Environment

The National Institute of Informatics' Yamagishi Lab, located in Hitotsubashi, Tokyo, provides a suitable environment for conducting a wide range of research on sound and language.
The laboratory provides the ideal space to conduct a wide range of research on sound and language, from acoustic experiments, machine-learning research on sound generation and identification models based on large-scale data, and applied science research to solve societal issues such as combating deepfakes.

  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows
  • Research Fellows

for Companies

Yamagishi Lab
specializes in research on acoustics and linguistics.

  1. 01

    Sound generation and conversion research

    Speech Generation and Transformation

    Our research includes text-to-speech technology that automatically synthesizes natural sounds from textual information in various languages, and sound generation models that automatically generate natural instrumental sounds from musical score information such as MIDI. We also conduct research on integrating speech signal processing and machine learning, voice conversion techniques such as intelligibility enhancement to improve the clarity of voices in crowded places, and making computers capable of understanding sound quality as heard by humans.

  2. 02

    Research on automated speaker recognition from speech

    Speaker Recognition from Speech

    Speech carries information beyond language. It can also help computers understand non-verbal details such as who the speaker is, their gender, dialect, emotions, health status, among others. Our laboratory is also working on research concerning speaker recognition (voice biometric technology). These efforts aim to improve the matching accuracy of speaker recognition and system robustness against noise, as well as develop technology to guard against voice impersonation.

  3. 03

    Research on Discerning Between AI-synthesized Speech
    and Human Speech.

    Distinguishing AI-Generated Speech from Human Speech

    Generative AI can automatically generate audio and video media remarkably similar to the real thing. While this technology is expected to be useful for various business applications, its misuse for malicious purposes may cause serious social harm. Our laboratory is working on how to accurately and automatically detect deepfakes to prevent the proliferation of misinformation.

  4. 04

    Voice Privacy Research,
    Such as Those on Voice Anonymization.

    Voice Privacy and Anonymization

    With the current ease of identifying individuals from audio on the Web and the creation of deepfake audio through voice cloning, speaker anonymization technology is becoming essential to protect the speaker's privacy by modifying speech data before releasing it to the public. Our laboratory is conducting research to transform voices so that they sound natural while keeping the speaker anonymous. This research aims to create technologies that can be used for broadcasting.

Social applications ArrowArrow

Joint Research

Collaborations Between Corporations and Yamagishi Lab

The National Institute of Informatics' Yamagishi Laboratory can work with businesses in three ways: joint research, technology provision (patent and software licensing), and technical support and advisory services.

Three Key Advantages of Collaborative Research

  • Presentations at Top International Conferences

    ( 01 )

    Presentations at Top International Conferences

    The findings of collaborative research with Yamagishi Lab can be presented at top international conferences and journal papers on speech information processing.

  • Global Technology Recognition

    ( 02 )

    Global Technology Recognition

    Many of these speech information processing conferences are also attended by Big Tech, providing an opportunity for our research and technological achievements to gain worldwide recognition.

  • Enhancing the Research Expertise of Corporate Researchers

    ( 03 )

    Enhancing the Research Expertise of Corporate Researchers

    We provide a range of support and advice via collaborative research to studies being conducted by corporate researchers. This aid helps grow the skills of these corporate researchers.

Collaboration Project Members

Depending on the size of the company's budget,
we can assign researchers from Yamagishi Lab to assist as collaborative project members to supplement our support and guidance.

Lab Members ArrowArrow
Lab Members

Providing Technology and Technical Support

  • Patents and Software Licenses

    Patents and Software Licenses

    We offer our technology to companies
    that want to use our research in their business.

  • Technical Support and Advisory

    Technical Support and Advisory

    Using our scientific and technical expertise
    to help companies overcome problems.

List of Results ArrowArrow

Achievements

Track Record of Collaboration with Companies

  • Collaborative Research

    Fujitsu Limited

    Fujitsu, along with nine industry-academic organizations, is working on a collaborative research project aimed at evaluating audio and video content to determine whether they are deepfakes. Information obtained from these sources, along with other information, will then be used to counter the spread of misinformation.

  • Collaborative Research

    DataGrid Inc.

    We developed a deepfake detection model for Japanese audio as part of a joint initiative with DATAGRID Inc. to create technology for countermeasures against false and misleading information on the Internet.

  • Provided Technologies

    Japan Broadcasting Corporation (NHK)

    Speaker anonymization technology developed by Yamagishi Lab is being used by NHK (Japan Broadcasting Corporation) in their TV broadcasts.

    Provided Technologies

    CyberAgent, Inc.

    We provided CyberAgent, Inc. with SYNTHETIQ VISION, a program that automatically determines the authenticity of AI-generated faces.

  • Technical Guidance

    HOYA Corporation Readspeaker Division

    Using Yamagishi Lab's speech intelligibility enhancement technology, we improved the ReadSpeaker text-to-speech system for Tokaido Shinkansen station broadcasts and provided technical support on enhancing speech intelligibility in noisy environments.

  • Technical Advisor

    Yamagishi’s Side Job

    ORENDA WORLD Co., Ltd.

    Dr. Yamagishi works as an advisor to various voice AI projects conducted by ORENDA WORLD, Inc.