2017-02-14 84 views
0

我想将2个CAF文件本地转换为一个文件。这两个CAF文件是单声道流,理想情况下,我希望它们是一个立体声文件,这样我就可以从一个声道获得麦克风,从另一个声道获得扬声器。如何在iOS中将2个单声道文件转换为单个立体声文件?

我最初是通过使用AVAssetTrack和AVMutableCompositionTracks开始的,但是我无法解决混音问题。我的合并文件是一个单一的单一流,交错两个文件。所以我选择了AVAudioEngine路线。

从我的理解,我可以通过我的两个文件作为输入节点,将它们连接到混音器,并有一个能够获得立体声混音的输出节点。输出文件具有立体声布局,但没有音频数据似乎写入它,因为我可以在Audacity中打开它并查看立体声布局。在installTapOnBus调用周围放置dipatch sephamore信号也没有多大帮助。 CoreAudio一直是一个难以理解的挑战,因此我们将不胜感激。

// obtain path of microphone and speaker files 
NSString *micPath = [[NSBundle mainBundle] pathForResource:@"microphone" ofType:@"caf"]; 
NSString *spkPath = [[NSBundle mainBundle] pathForResource:@"speaker" ofType:@"caf"]; 
NSURL *micURL = [NSURL fileURLWithPath:micPath]; 
NSURL *spkURL = [NSURL fileURLWithPath:spkPath]; 

// create engine 
AVAudioEngine *engine = [[AVAudioEngine alloc] init]; 

AVAudioFormat *stereoFormat = [[AVAudioFormat alloc] initStandardFormatWithSampleRate:16000 channels:2]; 

AVAudioMixerNode *mainMixer = engine.mainMixerNode; 

// create audio files 
AVAudioFile *audioFile1 = [[AVAudioFile alloc] initForReading:micURL error:nil]; 
AVAudioFile *audioFile2 = [[AVAudioFile alloc] initForReading:spkURL error:nil]; 

// create player input nodes 
AVAudioPlayerNode *apNode1 = [[AVAudioPlayerNode alloc] init]; 
AVAudioPlayerNode *apNode2 = [[AVAudioPlayerNode alloc] init]; 

// attach nodes to the engine 
[engine attachNode:apNode1]; 
[engine attachNode:apNode2]; 

// connect player nodes to engine's main mixer 
stereoFormat = [mainMixer outputFormatForBus:0]; 
[engine connect:apNode1 to:mainMixer fromBus:0 toBus:0 format:audioFile1.processingFormat]; 
[engine connect:apNode2 to:mainMixer fromBus:0 toBus:1 format:audioFile2.processingFormat]; 
[engine connect:mainMixer to:engine.outputNode format:stereoFormat]; 

// start the engine 
NSError *error = nil; 
if(![engine startAndReturnError:&error]){ 
    NSLog(@"Engine failed to start."); 
} 

// create output file 
NSString *mergedAudioFile = [[micPath stringByDeletingLastPathComponent] stringByAppendingPathComponent:@"merged.caf"]; 
[[NSFileManager defaultManager] removeItemAtPath:mergedAudioFile error:&error]; 
NSURL *mergedURL = [NSURL fileURLWithPath:mergedAudioFile]; 
AVAudioFile *outputFile = [[AVAudioFile alloc] initForWriting:mergedURL settings:[engine.inputNode inputFormatForBus:0].settings error:&error]; 

// write from buffer to output file 
[mainMixer installTapOnBus:0 bufferSize:4096 format:[mainMixer outputFormatForBus:0] block:^(AVAudioPCMBuffer *buffer, AVAudioTime *when){ 
    NSError *error; 
    BOOL success; 
    NSLog(@"Writing"); 
    if((outputFile.length < audioFile1.length) || (outputFile.length < audioFile2.length)){ 
     success = [outputFile writeFromBuffer:buffer error:&error]; 
     NSCAssert(success, @"error writing buffer data to file, %@", [error localizedDescription]); 
     if(error){ 
      NSLog(@"Error: %@", error); 
     } 
    } 
    else{ 
     [mainMixer removeTapOnBus:0]; 
     NSLog(@"Done writing"); 
    } 
}]; 

}

+0

你持有的强引用你写的AVAudioFile? – dave234

+0

@ Dave,outputFile在写入之前不存在。在强引用方面,我将audioFile设置为写入mergedURL,这是mergedAudioFile的fileURLWithPath。没有其他对象/变量引用outputFile,并且在installTapOnBus调用之后我没有销毁它。 – A21

+0

这种方法的一个弱点是,你将不得不等待文件的持续时间被渲染为一个。这就是说,如果你坚持使用AVAudioEngine,你可能会试着让这两个文件先玩。然后,一旦该步骤完成,安装轻击并写入文件。但如果我自己做,我会使用C API。 – dave234

回答

2

ExtAudioFile这样做涉及三个文件和三个缓冲区。两个单声道读取,一个立体声写入。在一个循环中,每个单声道文件会将一段音频读取到其单声道输出缓冲区,然后复制到正确的“一半”立体声缓冲区中。然后在立体声缓冲器中充满数据,将该缓冲器写入输出文件,直到两个单声道文件完成读取(如果一个单声道文件比另一个单声道文件长,则写入零)。

对于我来说最成问题的领域是正确获取文件格式,核心音频需要非常特定的格式。幸运的是,存在AVAudioFormat以简化一些常用格式的创建。

每个音频文件读取器/写入器有两种格式,一种代表数据存储的格式(file_format),另一种指定进出读写器格式(client_format)。如果格式不同,格式转换器内置给读者/作者。

下面是一个例子:

-(void)soTest{ 


    //This is what format the readers will output 
    AVAudioFormat *monoClienFormat = [[AVAudioFormat alloc]initWithCommonFormat:AVAudioPCMFormatInt16 sampleRate:44100.0 channels:1 interleaved:0]; 

    //This is the format the writer will take as input 
    AVAudioFormat *stereoClientFormat = [[AVAudioFormat alloc]initWithCommonFormat:AVAudioPCMFormatInt16 sampleRate:44100 channels:2 interleaved:0]; 

    //This is the format that will be written to storage. It must be interleaved. 
    AVAudioFormat *stereoFileFormat = [[AVAudioFormat alloc]initWithCommonFormat:AVAudioPCMFormatInt16 sampleRate:44100 channels:2 interleaved:1]; 




    NSURL *leftURL = [NSBundle.mainBundle URLForResource:@"left" withExtension:@"wav"]; 
    NSURL *rightURL = [NSBundle.mainBundle URLForResource:@"right" withExtension:@"wav"]; 

    NSString *stereoPath = [documentsDir() stringByAppendingPathComponent:@"stereo.wav"]; 
    NSURL *stereoURL = [NSURL URLWithString:stereoPath]; 

    ExtAudioFileRef leftReader; 
    ExtAudioFileRef rightReader; 
    ExtAudioFileRef stereoWriter; 


    OSStatus status = 0; 

    //Create readers and writer 
    status = ExtAudioFileOpenURL((__bridge CFURLRef)leftURL, &leftReader); 
    if(status)printf("error %i",status);//All the ExtAudioFile functins return a non-zero status if there's an error, I'm only checking one to demonstrate, but you should be checking all the ExtAudioFile function returns. 
    ExtAudioFileOpenURL((__bridge CFURLRef)rightURL, &rightReader); 
    //Here the file format is set to stereo interleaved. 
    ExtAudioFileCreateWithURL((__bridge CFURLRef)stereoURL, kAudioFileCAFType, stereoFileFormat.streamDescription, nil, kAudioFileFlags_EraseFile, &stereoWriter); 


    //Set client format for readers and writer 
    ExtAudioFileSetProperty(leftReader, kExtAudioFileProperty_ClientDataFormat, sizeof(AudioStreamBasicDescription), monoClienFormat.streamDescription); 
    ExtAudioFileSetProperty(rightReader, kExtAudioFileProperty_ClientDataFormat, sizeof(AudioStreamBasicDescription), monoClienFormat.streamDescription); 
    ExtAudioFileSetProperty(stereoWriter, kExtAudioFileProperty_ClientDataFormat, sizeof(AudioStreamBasicDescription), stereoClientFormat.streamDescription); 


    int framesPerRead = 4096; 
    int bufferSize = framesPerRead * sizeof(SInt16); 

    //Allocate memory for the buffers 
    AudioBufferList *leftBuffer = createBufferList(bufferSize,1); 
    AudioBufferList *rightBuffer = createBufferList(bufferSize,1); 
    AudioBufferList *stereoBuffer = createBufferList(bufferSize,2); 

    //ExtAudioFileRead takes an ioNumberFrames argument. On input the number of frames you want, on otput it's the number of frames you got. 0 means your done. 
    UInt32 leftFramesIO = framesPerRead; 
    UInt32 rightFramesIO = framesPerRead; 



    while (leftFramesIO || rightFramesIO) { 
     if (leftFramesIO){ 
      //If frames to read is less than a full buffer, zero out the remainder of the buffer 
      int framesRemaining = framesPerRead - leftFramesIO; 
      if (framesRemaining){ 
       memset(((SInt16 *)leftBuffer->mBuffers[0].mData) + framesRemaining, 0, sizeof(SInt16) * framesRemaining); 
      } 
      //Read into left buffer 
      leftBuffer->mBuffers[0].mDataByteSize = leftFramesIO * sizeof(SInt16); 
      ExtAudioFileRead(leftReader, &leftFramesIO, leftBuffer); 
     } 
     else{ 
      //set to zero if no more frames to read 
      memset(leftBuffer->mBuffers[0].mData, 0, sizeof(SInt16) * framesPerRead); 
     } 

     if (rightFramesIO){ 
      int framesRemaining = framesPerRead - rightFramesIO; 
      if (framesRemaining){ 
       memset(((SInt16 *)rightBuffer->mBuffers[0].mData) + framesRemaining, 0, sizeof(SInt16) * framesRemaining); 
      } 
      rightBuffer->mBuffers[0].mDataByteSize = rightFramesIO * sizeof(SInt16); 
      ExtAudioFileRead(rightReader, &rightFramesIO, rightBuffer); 
     } 
     else{ 
      memset(rightBuffer->mBuffers[0].mData, 0, sizeof(SInt16) * framesPerRead); 
     } 


     UInt32 stereoFrames = MAX(leftFramesIO, rightFramesIO); 

     //copy left to stereoLeft and right to stereoRight 
     memcpy(stereoBuffer->mBuffers[0].mData, leftBuffer->mBuffers[0].mData, sizeof(SInt16) * stereoFrames); 
     memcpy(stereoBuffer->mBuffers[1].mData, rightBuffer->mBuffers[0].mData, sizeof(SInt16) * stereoFrames); 

     //write to file 
     stereoBuffer->mBuffers[0].mDataByteSize = stereoFrames * sizeof(SInt16); 
     stereoBuffer->mBuffers[1].mDataByteSize = stereoFrames * sizeof(SInt16); 
     ExtAudioFileWrite(stereoWriter, stereoFrames, stereoBuffer); 

    } 

    ExtAudioFileDispose(leftReader); 
    ExtAudioFileDispose(rightReader); 
    ExtAudioFileDispose(stereoWriter); 

    freeBufferList(leftBuffer); 
    freeBufferList(rightBuffer); 
    freeBufferList(stereoBuffer); 

} 

AudioBufferList *createBufferList(int bufferSize, int numberBuffers){ 
    assert(bufferSize > 0 && numberBuffers > 0); 
    int bufferlistByteSize = sizeof(AudioBufferList); 
    bufferlistByteSize += sizeof(AudioBuffer) * (numberBuffers - 1); 
    AudioBufferList *bufferList = malloc(bufferlistByteSize); 
    bufferList->mNumberBuffers = numberBuffers; 
    for (int i = 0; i < numberBuffers; i++) { 
     bufferList->mBuffers[i].mNumberChannels = 1; 
     bufferList->mBuffers[i].mData = malloc(bufferSize); 
    } 
    return bufferList; 
}; 
void freeBufferList(AudioBufferList *bufferList){ 
    for (int i = 0; i < bufferList->mNumberBuffers; i++) { 
     free(bufferList->mBuffers[i].mData); 
    } 
    free(bufferList); 
} 
NSString *documentsDir(){ 
    static NSString *path = NULL; 
    if(!path){ 
     path = NSSearchPathForDirectoriesInDomains(NSDocumentDirectory, NSUserDomainMask, 1).firstObject; 
    } 
    return path; 
} 
+0

我正在取回每个通道没有输出的立体声文件。输入单声道文件是CAF型的,但我不希望格式偏差太大。 – A21

+0

你在检查所有的ExtAudioFile返回值吗? – dave234

+0

是的,注意到这个问题是EAF输出文件的创建。我传入的网址是扩展名 - “.caf”与您的“.wav”相比较。给我一个1718449215的OSStatus错误,它指的是kAudioFormatUnsupportedDataFormatError。 – A21

相关问题