Create a variable named newString as type String.I am simply taking the file name and replacing the extension with. txt file and we want to read that file, but we first need a variable that will point to it. Add an Assign activity after Invoke Power Shell Set Filter By to *.m4a (or the file extension(s) used for your recordings)Ĥ.Set In Folder to the folder where your audio recording files are saved.Tip: You may need to add the dependency for this and other activities mentioned below via Manage Packages in UiPath Studio.Drop a For Each File In Folder UiPath Activity onto the sequence.Open UiPath Studio, start a new blank project.You wouldn’t want to do some of what I have done below in a production implementation. Tip: If you don’t have audio recordings handy, use Windows Voice Recorder to create and save a recordingĪnother reminder that this is an art of the possible guide.You might want to cd to a directory where you would like these files stored. Tip: Whisper will generate several files including a text file of the transcribed audio.Tip: Enter whisper at the prompt to see a list of all parameter options and languages supported.Tip: If you know the language of your recordings, add that as a parameter for faster processing.Enter whisper -model base Įxample: whisper -model base “C:\Voicemails\Mar\Call0000001.m4a”.If the choco command doesn’t work, you may need to install Chocolatey manually. Tip: I believe when you install and upgrade PIP it will also install Chocolatey.I was already this far so I thought I should give this Whisper tool a try.? Note: Recall above I mentioned that IT asked me to upgrade Python.I am not covering Python installation in this guide because there are too many factors to consider and there’s a lot of content available on the topic. If it doesn’t, you need to install Python or find a machine that does have Python. Enter py -version into the PowerShell command prompt.Check that you have Python installed by checking the Python version number If you are going to have problems it will likely be here and I don’t want you to go through all the other steps if this doesn’t work.Ģ. There are many speech-to-text tools that may be a better fit for your needs. This fulfills a common request for a “free” option. This guide uses OpenAI’s Whisper for speech-to-text.This guide shows the art of the possible and is not “production worthy.” I have included ideas for production in the guide.This is for Windows Mac and Linux options are available but not covered in this guide.Assumes you are familiar with PowerShell.Assumes Python is installed on the machine or VM where you will run OpenAI Whisper.Assumes you are familiar with UiPath and have UiPath Studio.If you are not interested in the technical bits and bytes, choose your own adventure, scroll to the end for the summary.Stick with me, the relevance of this will be clearer later. And the oh-so-fun event of IT sending me a note that my PC may be compromised due to it running an older version of Python, and I thus needed to upgrade it stat.I started out with the intent of just spending an hour playing around with it, but quickly found it to be a powerful (and addictive) tool combining AI and automation. UiPath’s preview release of a connector for OpenAI.Requests for a simple, cost effective (i.e., free) method to analyze call recordings, voicemails, and similar audio, and then automate based on insights from AI. To start, this guide came about thanks to the convergence of a few events: It seems to be getting full and I am frankly feeling a bit of fomo. A guide to gaining insights from contact center recordings and transcriptions with AI and automationĪlright everyone, this guide is mostly about me jumping on the OpenAI bandwagon.
0 Comments
Leave a Reply. |