Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
Formosa-Speech-in-the-Wild
Project
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
5
Issues
5
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Yuan-Fu Liao
Formosa-Speech-in-the-Wild
Commits
f5463a30
Commit
f5463a30
authored
Apr 14, 2018
by
Yuan-Fu Liao
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Update README.md
parent
97fd06aa
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
27 additions
and
13 deletions
+27
-13
README.md
README.md
+27
-13
No files found.
README.md
View file @
f5463a30
...
...
@@ -2,23 +2,37 @@
##### Yuan-Fu Liao, Taipei University of Technology, [yfliao@mail.ntut.edu.tw](mailto:[yfliao@mail.ntut.edu.tw)
### 語料庫現況
簡介
這是整個TSW語料庫現況簡介的public project,若有關於整個TSW的
general
問題,歡迎在此發問(請用
[
issues
](
https://speech.nchc.org.tw/yfliao/Taiwanese-Speech-in-the-Wild/issues
)
)!
### 語料庫現況
這是整個TSW語料庫現況簡介的public project,若有關於整個TSW的問題,歡迎在此發問(請用
[
issues
](
https://speech.nchc.org.tw/yfliao/Taiwanese-Speech-in-the-Wild/issues
)
)!
若是針對個別子語料庫的問題請移駕到各子語料庫project網頁!
*
`如果有意願幫助校正語料,為語料庫盡一份心力,可以知會廖元甫(yfliao@mail.ntut.edu.tw),先協調工作分配,以免重複。`
### Bug Report
*
CA error
> git clone https://speech.nchc.org.tw/GrandChallenge/MATBN.git/ 時出現CA error
>
> fatal: unable to access 'https://speech.nchc.org.tw/GrandChallenge/MATBN.git/': server certificate verification failed. CAfile: /etc/ssl/certs/ca-certificates.crt CRLfile: none
workaround
> disable the ca-certificates verification by export GIT_SSL_NO_VERIFY=1
### 公告
#### 1.
The first wave of TSW corpora consists 5 subsets (beta version, except MATBN) and has been officially released on April 11, 2018!
*
The first wave of TSW corpora consists 5 subsets (beta version, except MATBN) and has been officially released on April 11, 2018!
|Corpus|abbreviation|Source|Hours|Remark|
|:---|:---|:---:|---:|:--|
|Mandarin Chinese Broadcast News corpus |MATBN|PTS|198.0|story and speaker boundaries|
|NER Phonetic Annotation corpus Vol. 1|NER-PhA-Vol1 |NER|6.5 | phone, syllable, speaker and code-switching|
|NER Manual Transcription corpus Vol. 1|NER-Trs-Vol1 |NER| 107.4 | manual, word sequences|
|NER Automatic Transcription corpus Vol. 1|NER-Auto-Vol1 |NER| 309.6 | auto, word sequences|
|PTS Manual Subtitlig corpus Vol. 1 |PTS-MSub-Vol1 |PTS| 264.0 | manual subtitling with time code|
|Total|||879.0| exclude NER-PhA-Vol1|
|Corpus|abbreviation|Source|Hours|Remark|
|:---|:---|:---:|---:|:--|
|Mandarin Chinese Broadcast News corpus |MATBN|PTS|198.0|story and speaker boundaries|
|NER Phonetic Annotation corpus Vol. 1|NER-PhA-Vol1 |NER|6.5 | phone, syllable, speaker and code-switching|
|NER Manual Transcription corpus Vol. 1|NER-Trs-Vol1 |NER| 107.4 | manual, word sequences|
|NER Automatic Transcription corpus Vol. 1|NER-Auto-Vol1 |NER| 309.6 | auto, word sequences|
|PTS Manual Subtitlig corpus Vol. 1 |PTS-MSub-Vol1 |PTS| 264.0 | manual subtitling with time code|
|Total|||879.0| exclude NER-PhA-Vol1|
*
PTS: Taiwan Public Television Service
*
NER: National Education Radio
*
PTS: Taiwan Public Television Service
*
NER: National Education Radio
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment