Re: Trying to split on silence but losing content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Thanks again Jan, here goes,

Here is the link to the actual video:
http://www.spellingbeewinnars.org/whats-been-happening.mp3

It is in Afrikaans, I am studying this language.

Here is the full output:

sox -V $(ls | sort -f) conten.wav
sox:      SoX v14.4.2
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'

Input File     : 'out001.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:02.28 = 100495 samples = 170.91 CDDA sectors
File Size      : 402k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out002.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:02.50 = 110225 samples = 187.457 CDDA sectors
File Size      : 441k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out003.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:00.71 = 31431 samples = 53.4541 CDDA sectors
File Size      : 126k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out004.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:02.84 = 125268 samples = 213.041 CDDA sectors
File Size      : 501k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out005.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:00.50 = 22239 samples = 37.8214 CDDA sectors
File Size      : 89.0k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out006.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:08.41 = 371047 samples = 631.032 CDDA sectors
File Size      : 1.48M
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out007.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:01.89 = 83369 samples = 141.784 CDDA sectors
File Size      : 334k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Input File     : 'out008.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:00.98 = 43117 samples = 73.3282 CDDA sectors
File Size      : 173k
Bit Rate       : 1.41M
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no


Output File    : 'conten.wav'
Channels       : 2
Sample Rate    : 44100
Precision      : 16-bit
Duration       : 00:00:20.12 = 887191 samples = 1508.83 CDDA sectors
Sample Encoding: 16-bit Signed Integer PCM
Endian Type    : little
Reverse Nibbles: no
Reverse Bits   : no
Comment        : 'Processed by SoX'

sox INFO sox: effects chain: input        44100Hz  2 channels
sox INFO sox: effects chain: output       44100Hz  2 channels


As far as the intentions, I am just hoping to split an audio file on
short sentences. I get overwhelmed quite quickly with longer audio.


This is the output of the script file:
:cat for-jan
Script started on 2021-11-01 13:11:41-0400
:sox ../whats-been-happening.mp3 out.wav silence   1 0.5 5%    1 0.5 5%
: newfile : restart
:sox -V $(ls | sort -f) conten.wav
sox:      SoX v14.4.2
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox INFO formats: detected file format type `wav'
sox FAIL formats: can't determine type of file `for-jan'
:exit
exit

Script done on 2021-11-01 13:12:29-0400



Thanks












On 2021-11-01 12:43 p.m., Jan Stary wrote:
> On Nov 01 11:49:46, patrick@xxxxxxxxxxxxxxxxxxxxxx wrote:
>> sox ../whats-been-happening.mp3 out.wav silence   1 0.5 5%    1 0.5 5% :
>> newfile : restart
> Run that again with -V and show sox's response, in full.
>
>> and this was the command to concatenate:
>> sox -V $(ls | sort -f) conten.wav
>>
>> Here is the first printout, there were 8 of them and they were all similar.
>>
>> """"
>> Input File     : 'out001.wav'
>> Channels       : 2
>> Sample Rate    : 44100
>> Precision      : 16-bit
>> Duration       : 00:00:02.28 = 100495 samples = 170.91 CDDA sectors
> This one, for example, is 2.28 seconds long.
> Is that intended?
>
>> File Size      : 402k
>> Bit Rate       : 1.41M
>> Sample Encoding: 16-bit Signed Integer PCM
>> Endian Type    : little
>> Reverse Nibbles: no
>> Reverse Bits   : no
>> """
> You are now showing the Output File section,
> so we still know nothing. Post the response in full.
> Use the script(1) tool if you were unaware of it.
>
>> I was able to split audio files fine with Audacity and I
>> thought that it uses Sox in the background
> Audacity and SoX have nothing to do with each other.
>
>> and that I must have just
>> done something dumb but now I am not sure. The resulting concatenated
>> audio file from sox is a lot shorter than the files that were inputted.
> We still don't know what the files are and what SoX does with them.
> If it's not private, please post whats-been-happening.mp3 somewhere.
>
> 	Jan
>
>> On 2021-10-31 8:41 a.m., Jan Stary wrote:
>>> On Oct 30 15:21:52, patrick@xxxxxxxxxxxxxxxxxxxxxx wrote:
>>>> I would like to take 10 minutes of audio and split it on silence into
>>>> numerous audio files. I am then hoping to use a "for loop" to repeat
>>>> these split sections.
>>>>
>>>> If the audio was 001.wav 002.wav 003.wav, I want to concatenate ten
>>>> 001.wav files together then ten 002.wav files etc.
>>>>
>>>> I ran this command:
>>>>
>>>> sox audio.wav out.wav silence   1 0.5 5%    1 0.5 5% : newfile : restart
>>>>
>>>> It splits fine
>>> Meaning, you have out001.wav, out002.wav, etc?
>>> Did you actually play them? What does soxi out*wav say?
>>>
>>>> but if I concatenate the results back together to check
>>>> for lost sound, I get about 15 seconds of audio from a 10 minute audio file.
>>> What exactly is your "concatenating" command? (Please show sox -V)
>>>
>>>> Does anyone know why I am losing content? Is there a better set of
>>>> command arguments to do this?
>>> Hard to say if we don't know what you did and what sox said.
>>>
>>> 	Jan




_______________________________________________
Sox-users mailing list
Sox-users@xxxxxxxxxxxxxxxxxxxxx
https://lists.sourceforge.net/lists/listinfo/sox-users




[Index of Archives]     [Linux Sound]     [ALSA Users]     [ALSA Devel]     [Linux Audio Users]     [Linux Media]     [Kernel]     [Photo Sharing]     [Gimp]     [Yosemite News]     [Linux Media]

  Powered by Linux