csound-xtract : Csound feature extraction using libXtract
Overview
csound-xtract is a set of plugin opcodes which use libXtract to perform feature extraction and associated tasks from within Csound.
Development is still ongoing and subject to various research matters, thus is provided in an experimental/alpha state and may contain bugs. Parts of the code are due overhauls and refactoring, but the intention is for the opcodes and general operation to remain the same as presented here.
Requirements
- Cmake >= 2.8.12
- Csound with development headers >= 6.14.0
- LibXtract
Tested on Linux and Windows 7 with MSYS as of March 2021.
Installation
Create a build directory at the top of the source tree, execute cmake .., make and optionally make install as root. If the latter is not used/possible then the resulting libcsxtract library can be used with the --opcode-lib flag in Csound. eg:
mkdir build && cd build
cmake ..
make && sudo make install
Cmake should find Csound and libXtract using the modules in the cmake/Modules directory and installation should be as simple as above.
Examples
Some examples are provided in the examples directory.
Opcode reference
iprofile xtprofile [ibuffersize=4096, iblocksize=512, imfccs=1, icentroid=1, izerocrossings=1, irms=1, iflatness=1, iirregularity=1, ipower=1, isharpness=1, ismoothness=1]
Declare a feature extraction profile for use in extraction opcodes. The exact techniques for extraction of individual features can be found by examining the libXtract documentation and source code.
-
iprofile : the profile handle
-
ibuffersize : buffer size used in extraction
- iblocksize : block size for extraction
- imfccs : use MFCCs
- icentroid : use spectral centroid
- izerocrossings : use zero crossings
- irms : use RMS
- iflatness : use spectral flatness
- iirregularity : use spectral irregularity
- ipower : use spectral power
- isharpness : use spectral sharpnesss
icorpus xtcorpus iprofile, ifn
Analyse sound contained in a f-table and store in a handle, for later use in comparison/matching opcodes. Done during init time.
-
icorpus : the corpus handle
-
iprofile : profile handle as created by xtprofile
- ifn : f-table containing the sound to be analysed, typically GEN1
ixtract, kdone xtractor iprofile, ain
Analyse a live sound.
-
ixtract : the extraction handle
-
iprofile : profile handle as created by xtprofile
- ain : sound to analyse
kdistance xtdistance ixtract1, ixtract2, ktrigger, [idistancefunc=0]
Compare two extraction streams using a basic distance function between each frame of the analyses. The profiles used for the streams must be the same.
-
kdistance : the calculated distance, 0 should represent no difference between the analyses.
-
ixtract1 : analysis stream as created by xtractor
- ixtract2 : analysis stream as created by xtractor
- ktrigger : comparison is conducted when 1
- idistancefunc : 0 for Euclidean distance, 1 for Manhattan distance
kanalysis[] xtdump ixtract
Obtain the analysis data from an xtractor handle. The array length will depend on the number of features specified in the profile used (MFCC uses 13 indexes, all others use 1). The indexes are presented in the same order as declared in xtprofile. For example, a profile using MFCC and centroid would mean indexes 0 to 12 would be MFCCs, and 13 would be centroid. Similarly a profile using only centroid and flatness would imply index 0 would be centroid, and 1 would be flatness.
-
kanalysis[] : the analysed features
-
ixtract : analysis stream as created by xtractor
kdone, kanalysis[] xtaccdump ixtract, ktrigger
Obtain the analysis data from an xtractor handle as with xtdump, but accumulate the analyses and output the mean when ktrigger is 1.
- kdone : outputs 1 when new data is provided, 0 at all other times
-
kanalysis[] : the analysed features as with xtdump.
-
ixtract : analysis stream as created by xtractor
- ktrigger : triggers the output of data when 1
kposition xtcorpusmatch icorpus, ixtract, ktrigger, [idistancefunc=0]
Obtain the nearest match in a corpus created with xtcorpus by comparing a live input stream created by xtractor.
-
kposition : position of nearest corpus region, in sample points
-
icorpus : corpus handle as created by xtcorpus
- ixtract : analysis stream as created by xtractor
- ktrigger : perform the comparison when 1
- idistancefunc : 0 for Euclidean distance, 1 for Manhattan distance