split ark files

310 views
Skip to first unread message

Ana Montalvo

unread,
Dec 5, 2016, 10:44:26 AM12/5/16
to kaldi-help
Hi all!
Is there any way to split ark files? My problem is that I have 4Gb ark files and I need to read the features in txt format, which make them even heavier. subset-feats only gives me the possibility of extract the n firsts feature-files, but I need to divide it in smaller ark files.
Any suggestions?
thx

Daniel Povey

unread,
Dec 5, 2016, 11:58:03 AM12/5/16
to kaldi-help
If you copy the ark files to ark,scp (or write them that way at the
start), you can use the scp files instead which are more convenient
for splitting; and then you can copy those scp files, after splitting,
to new ark files. Read "I/O from a command line persspective" in the
documentation.

Dan
> --
> You received this message because you are subscribed to the Google Groups
> "kaldi-help" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to kaldi-help+...@googlegroups.com.
> For more options, visit https://groups.google.com/d/optout.

Ana Montalvo

unread,
Dec 5, 2016, 2:08:40 PM12/5/16
to kaldi...@googlegroups.com
Thx for your answer Dan, but in orther to understand:
If I edit a .scp file, with the freatures I am interested in for example:

'mine.scp'
zccrk /feats_bnf_test.1.ark:6
zccrq /feats_bnf_test.1.ark:503691
zcdzz /feats_bnf_test.1.ark:1007376

Is it possible to create 'mine.ark' file having a big 'original.ark'
file containing the features of my interest and many more?
How could I do that?
thx a lot!!
> You received this message because you are subscribed to a topic in the
> Google Groups "kaldi-help" group.
> To unsubscribe from this topic, visit
> https://groups.google.com/d/topic/kaldi-help/Lxoobrjr01s/unsubscribe.
> To unsubscribe from this group and all its topics, send an email to

Daniel Povey

unread,
Dec 5, 2016, 2:10:09 PM12/5/16
to kaldi-help
You can do something like
copy-feats scp:mine.scp ark:min.ark
but the real solution is to read the documentation page and understand
how the framework works.

Ana Montalvo

unread,
Dec 6, 2016, 10:45:30 AM12/6/16
to kaldi...@googlegroups.com
Thank you very much Daniel, all the documentation but specially your
prompt answers to kaldi users list are a great incentive to continue
working with Kaldi and it makes me feell as part of your team. I
really appreciate it, regards
ana
Reply all
Reply to author
Forward
0 new messages