README.md 1.24 KB
Newer Older
cbsc04's avatar
cbsc04 committed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
# Corpws Lleferydd Paldaruo | Paldaruo Speech Corpus


Corpws lleferydd Cymraeg a gasglwyd trwy ddulliau torfoli yr ap Paldaruo 

(cod yr ap : https://github.com/techiaith/Paldaruo)  


*A Welsh language speech corpus crowdsourced via the Paldaruo app*

*(app source code : https://github.com/techiaith/Paldaruo)*




cbsc04's avatar
cbsc04 committed
16
## Fersiwn 5 | Version 5
cbsc04's avatar
cbsc04 committed
17

cbsc04's avatar
cbsc04 committed
18
Cyhoeddwyd | *Published* : 19/12/2018
cbsc04's avatar
cbsc04 committed
19
20
21

Ffurf Sain | *Audio Format* : audio/wav, 48kHz

22
Amser Sain | *Audio Duration* : 40 awr/hours
cbsc04's avatar
cbsc04 committed
23

24
Nifer o ffeil | *Number of Files* : 14,215
cbsc04's avatar
cbsc04 committed
25

26
Cyfranwyr | *Number of Contributors* : 564 
cbsc04's avatar
cbsc04 committed
27
28
29
30
31



## Trwydded | License

Uned Technolegau Iaith's avatar
Uned Technolegau Iaith committed
32
Cydnabyddiath | *Attribution* : Cooper, S., Chan, D., Jones, D. B. (2017) The Paldaruo Speech Corpus, version 4 [http://techiaith.cymru/corpora/paldaruo/] 
cbsc04's avatar
cbsc04 committed
33
34
35
36
37
38
39
40
41
42
43
44
45
46

[![License: CC BY 4.0](https://img.shields.io/badge/License-CC%20BY%204.0-lightgrey.svg)](http://creativecommons.org/licenses/by/4.0/)


## Sut i llwytho'r corpws i lawr | How to download the corpus

Bydd rhaid i chi defnyddio git lfs (https://git-lfs.github.com/).

*You will have to use git lfs (https://git-lfs.github.com/).*

Defnydiwch y gorchymyn canlynol :

*Use the following command :*

47
`git lfs clone --branch v5.0 --depth 1 https://git.techiaith.bangor.ac.uk/Data-Porth-Technolegau-Iaith/Corpws-Paldaruo.git`
cbsc04's avatar
cbsc04 committed
48