πŸ’»How to turn audio into Vectors

Assumed Knowledge: Vectors Target Audience: Python developers, general developers Reading Time: 3 minutes

To help transform data into vectors, we open-sourced a library called VectorHub (you can explore the hub at hub.vctr.ai). For this, you will need to use Python, and you can run all of the below on Colab.

pip install vectorhub[encoders-audio-tfhub]

The following model reads in the audio file and then encodes it (turning it into a vector).

from vectorhub.encoders.audio.tfhub import Vggish2Vec
model = Vggish2Vec()
sample = model.read('https://vecsearch-bucket.s3.us-east-2.amazonaws.com/voices/common_voice_en_2.wav')
model.encode(sample)

From this - you will have obtained a vector which can now be indexed and stored away for search.

Last updated