大規模な日本語話し言葉データベースを用いた講演音声認識

南條, 浩輝; 加藤, 一臣; 李, 晃伸; 河原, 達也

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

{"_buckets": {"deposit": "9e5e7c4c-a817-4d8b-ae98-9b83545d54bd"}, "_deposit": {"created_by": 2, "id": "3797", "owners": [2], "pid": {"revision_id": 0, "type": "depid", "value": "3797"}, "status": "published"}, "_oai": {"id": "oai:naist.repo.nii.ac.jp:00003797", "sets": ["35"]}, "author_link": ["6201", "6202", "6199", "6200"], "item_7_alternative_title_1": {"attribute_name": "その他のタイトル", "attribute_value_mlt": [{"subitem_alternative_title": "Lecture Speech Recognition Using Large Corpus of Spontaneous Japanese"}]}, "item_7_biblio_info_9": {"attribute_name": "書誌情報", "attribute_value_mlt": [{"bibliographicIssueDates": {"bibliographicIssueDate": "2003-04", "bibliographicIssueDateType": "Issued"}, "bibliographicIssueNumber": "4", "bibliographicPageEnd": "459", "bibliographicPageStart": "450", "bibliographicVolumeNumber": "J86-D-II", "bibliographic_titles": [{"bibliographic_title": "電子情報通信学会論文誌D-II"}]}]}, "item_7_description_19": {"attribute_name": "フォーマット", "attribute_value_mlt": [{"subitem_description": "application/pdf", "subitem_description_type": "Other"}]}, "item_7_description_7": {"attribute_name": "抄録", "attribute_value_mlt": [{"subitem_description": "開放的融合研究「話し言葉工学」プロジェクトにおいて構築されている日本語話し言葉コーパスを用いて講演音声の認識を行った.話し言葉は書き言葉の読上げ音声と大きく性質が異なるため,それに合致したモデル化と認識手法の検討が必要となる.音響モデルについては発話スタイルとデータ量の影響を調べた.言語モデルについては,話し言葉コーパスのデータ量不足を補うために他のコーパスと混合する方法,特に混合重みの最適化手法を提案する.また認識に際して,事前の発話のセグメンテーションが容易でないため,ショートポーズの自動認識に基づいて区分化と認識結果の確定を行う逐次デコーディング方式を提案・実装した.10名の話者による講演音声の認識実験で提案手法の有効性を示し,平均66.2%の認識率を得た.", "subitem_description_type": "Abstract"}]}, "item_7_publisher_10": {"attribute_name": "出版者", "attribute_value_mlt": [{"subitem_publisher": "電子情報通信学会"}]}, "item_7_rights_11": {"attribute_name": "出版者URL", "attribute_value_mlt": [{"subitem_rights": "http://ci.nii.ac.jp/naid/110003170907 | http://ci.nii.ac.jp/naid/110003170907"}]}, "item_7_rights_18": {"attribute_name": "権利", "attribute_value_mlt": [{"subitem_rights": "Copyright (C) 2003 電子情報通信学会."}]}, "item_7_source_id_12": {"attribute_name": "ISSN", "attribute_value_mlt": [{"subitem_source_identifier": "0915-1923", "subitem_source_identifier_type": "ISSN"}]}, "item_7_source_id_14": {"attribute_name": "書誌レコードID", "attribute_value_mlt": [{"subitem_source_identifier": "AN1007132X", "subitem_source_identifier_type": "NCID"}]}, "item_7_version_type_20": {"attribute_name": "著者版フラグ", "attribute_value_mlt": [{"subitem_version_resource": "http://purl.org/coar/version/c_970fb48d4fbd8a85", "subitem_version_type": "VoR"}]}, "item_creator": {"attribute_name": "著者", "attribute_type": "creator", "attribute_value_mlt": [{"creatorNames": [{"creatorName": "南條, 浩輝"}], "nameIdentifiers": [{"nameIdentifier": "6199", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "加藤, 一臣"}], "nameIdentifiers": [{"nameIdentifier": "6200", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "李, 晃伸"}], "nameIdentifiers": [{"nameIdentifier": "6201", "nameIdentifierScheme": "WEKO"}]}, {"creatorNames": [{"creatorName": "河原, 達也"}], "nameIdentifiers": [{"nameIdentifier": "6202", "nameIdentifierScheme": "WEKO"}]}]}, "item_files": {"attribute_name": "ファイル情報", "attribute_type": "file", "attribute_value_mlt": [{"accessrole": "open_date", "date": [{"dateType": "Available", "dateValue": "2023-03-02"}], "displaytype": "detail", "download_preview_message": "", "file_order": 0, "filename": "IEICEDII_JD86II_4_450.pdf", "filesize": [{"value": "6.9 MB"}], "format": "application/pdf", "future_date_message": "", "is_thumbnail": false, "licensetype": "license_free", "mimetype": "application/pdf", "size": 6900000.0, "url": {"label": "IEICEDII_JD86II_4_450.pdf", "url": "https://naist.repo.nii.ac.jp/record/3797/files/IEICEDII_JD86II_4_450.pdf"}, "version_id": "478a740d-e53c-4a3b-9ca2-7f2bb8cbff87"}]}, "item_keyword": {"attribute_name": "キーワード", "attribute_value_mlt": [{"subitem_subject": "話し言葉", "subitem_subject_scheme": "Other"}, {"subitem_subject": "音声認識", "subitem_subject_scheme": "Other"}, {"subitem_subject": "音響モデル", "subitem_subject_scheme": "Other"}, {"subitem_subject": "言語モデル", "subitem_subject_scheme": "Other"}, {"subitem_subject": "逐次デコーダ", "subitem_subject_scheme": "Other"}]}, "item_language": {"attribute_name": "言語", "attribute_value_mlt": [{"subitem_language": "jpn"}]}, "item_resource_type": {"attribute_name": "資源タイプ", "attribute_value_mlt": [{"resourcetype": "journal article", "resourceuri": "http://purl.org/coar/resource_type/c_6501"}]}, "item_title": "大規模な日本語話し言葉データベースを用いた講演音声認識", "item_titles": {"attribute_name": "タイトル", "attribute_value_mlt": [{"subitem_title": "大規模な日本語話し言葉データベースを用いた講演音声認識"}]}, "item_type_id": "7", "owner": "2", "path": ["35"], "permalink_uri": "http://hdl.handle.net/10061/7789", "pubdate": {"attribute_name": "公開日", "attribute_value": "2012-07-05"}, "publish_date": "2012-07-05", "publish_status": "0", "recid": "3797", "relation": {}, "relation_version_is_last": true, "title": ["大規模な日本語話し言葉データベースを用いた講演音声認識"], "weko_shared_id": -1}

大規模な日本語話し言葉データベースを用いた講演音声認識

http://hdl.handle.net/10061/7789

名前 / ファイル	ライセンス	アクション
IEICEDII_JD86II_4_450.pdf (6.9 MB)

Item type

学術雑誌論文 / Journal Article(1)

公開日

2012-07-05

タイトル

大規模な日本語話し言葉データベースを用いた講演音声認識

その他のタイトル

Lecture Speech Recognition Using Large Corpus of Spontaneous Japanese

言語

jpn

キーワード

主題Scheme

Other

主題

話し言葉

キーワード

主題Scheme

Other

主題

音声認識

キーワード

主題Scheme

Other

主題

音響モデル

キーワード

主題Scheme

Other

主題

言語モデル

キーワード

主題Scheme

Other

主題

逐次デコーダ

資源タイプ

journal article

著者

南條, 浩輝
加藤, 一臣
李, 晃伸
河原, 達也

抄録

内容記述タイプ

Abstract

内容記述

開放的融合研究「話し言葉工学」プロジェクトにおいて構築されている日本語話し言葉コーパスを用いて講演音声の認識を行った.話し言葉は書き言葉の読上げ音声と大きく性質が異なるため,それに合致したモデル化と認識手法の検討が必要となる.音響モデルについては発話スタイルとデータ量の影響を調べた.言語モデルについては,話し言葉コーパスのデータ量不足を補うために他のコーパスと混合する方法,特に混合重みの最適化手法を提案する.また認識に際して,事前の発話のセグメンテーションが容易でないため,ショートポーズの自動認識に基づいて区分化と認識結果の確定を行う逐次デコーディング方式を提案・実装した.10名の話者による講演音声の認識実験で提案手法の有効性を示し,平均66.2%の認識率を得た.

書誌情報

電子情報通信学会論文誌D-II

巻 J86-D-II, 号 4, p. 450-459, 発行日 2003-04

出版者

電子情報通信学会

出版者URL

権利情報

http://ci.nii.ac.jp/naid/110003170907 | http://ci.nii.ac.jp/naid/110003170907

ISSN

収録物識別子タイプ

ISSN

収録物識別子

0915-1923

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN1007132X

権利

権利情報

著者版フラグ

出版タイプ

VoR

戻る

views

See details

	Views

Versions

Ver.1

2023-07-25 14:15:47.201723

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

大規模な日本語話し言葉データベースを用いた講演音声認識

× 南條, 浩輝

× 加藤, 一臣

× 李, 晃伸

× 河原, 達也

Versions

Share

Cite as

エクスポート