Ermine

Contact for Pricing

Twitter

Facebook

Copy Link

Transcribe audio from your device microphone using 100% local / client-side processing.

How Ermine can help you:

Ensures privacy and security by processing audio recordings locally.
Provides immediate transcription without relying on cloud services.
Facilitates easy recording and transcription directly from your device.

Why choose Ermine: Key features

100% local processing for enhanced privacy.
No need for internet connectivity for transcription.
User-friendly interface for straightforward operation.

Who should choose Ermine:

Journalists and researchers who handle sensitive information.
Students and professionals needing quick transcription services.
Anyone concerned with privacy and security of their data.

About Ermine

Website

https://www.ermine.ai

Release Date

March 2024

Pricing

Contact for Pricing

Related fields

Related News

aiOla unveils open source AI audio transcription model that obscures sensitive info in realtime

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More Businesses looking to use AI models to transcribe audio, specifically human speech, from executives, employees, and customers, may be wary of the idea of an AI program listening to and recording sensitive information. However, the Israeli audio AI startup aiOla has a new model that addresses this very concern. Built atop OpenAI's industry-standard open source model Whisper, the new Whisper-NER from aiOla is itself fully open source and available now on Hugging Face and Github for enterprises organizations, and individuals to take, use, adapt, modify and deploy. It integrates automatic speech recognition (ASR) with named entity recognition (NER). This innovation aims to enhance privacy by automatically identifying and masking sensitive information such as names, phone numbers, and addresses during the transcription process. A demo model is available for users to try on Hugging Face as well, allowing them to record snippets of speech and have the model mask specific words they type in, in the resulting typed transcript. The model performed successfully in my brief test of masking the word "VentureBeat" in my speech, which is a proper noun and jaron. Whisper-NER addresses a significant challenge in the transcription of spoken content: ensuring privacy and compliance with data protection regulations. The model processes audio files and simultaneously applies NER to tag or mask specific types of sensitive information directly within the transcription pipeline. Unlike traditional multi-step systems, which leave data exposed during intermediary processing stages, Whisper-NER eliminates the need for separate ASR and NER tools, reducing vulnerability to breaches. "We designed this as an open-source tool to advance privacy in AI," said Gill Hetz, Vice President of Research at aiOla, in a recent video call interview with VentureBeat. "It helps users mask sensitive data without needing additional software steps." Previously, aiOla was noted for releasing Whisper variants that could accurately and reliably recognize industry-specific jargon and transcribe it, as well as a much faster speech-to-text and speech recognition model. Fully Open Source for Community and Commercial Use Whisper-NER is fully open source and available under the MIT License, allowing users to adopt, modify, and deploy it freely, including for commercial applications. The model can be accessed on GitHub and Hugging Face, ensuring its advanced capabilities are broadly available. A demo is also provided to help users explore its functionality and adaptability. The open-source release aligns with aiOla's philosophy of fostering collaboration and innovation. "AI moves forward when people collaborate," Hetz said. "That's why we've made this model open source -- to encourage adoption and improvement by the community." Innovation in Speech and Data Privacy Built on OpenAI's Whisper framework, Whisper-NER was trained on a synthetic dataset combining synthetic speech and text-based NER datasets. This unique training approach allowed the model to handle transcription and entity recognition tasks simultaneously, offering superior accuracy. "Instead of separating ASR transcription and NLP [natural language processing] entity extraction, we solved both in one block," said Hetz. "When extracting text, the model simultaneously identifies specified entities." This integrated approach, described in a research paper published to the open access, non-peer reviewed site arXiv.org, not only simplifies workflows but also significantly enhances data security. Additionally, Whisper-NER supports zero-shot learning, enabling it to recognize and mask entity types that were not explicitly included during training. The flexibility of Whisper-NER makes it suitable for a variety of use cases, including compliance monitoring, inventory management, quality assurance, and more. For applications that do not require masking, the model can be configured to simply tag sensitive entities, providing organizations with customizable options to suit their needs. "Highly regulated industries like healthcare and law benefit most from our privacy-first approach, but even companies with limited sensitive data can use this technology," said Hetz. Ethical AI and Adaptability Whisper-NER represents a step forward in ethical AI development by enabling secure, privacy-focused transcription. Its open-source availability ensures that developers, researchers, and organizations can freely incorporate the model into their operations. By reducing risks associated with data breaches, it aligns with the growing demand for secure, AI-powered solutions in industries like healthcare, legal, and customer service. "This version, built on Whisper, is best for English but supports multiple languages. Open-source contributors can adapt it further for diverse languages and jargon," Hetz explained. aiOla encourages global contributions to extend the model's reach and functionality. With Whisper-NER now available to the public, aiOla reinforces its commitment to creating responsible AI tools that prioritize user privacy and security while fostering collaboration and innovation through open access.

VentureBeat

Wed, 20 Nov, 4:06 PM UTC

I write for a living -- and this AI transcription software is a true game changer

Transcribing interviews is the thing I dislike most about being a journalist. My hat goes off to stenographers because sitting there and typing out an interview is anything but fun. Thankfully, we now have AI tools to help make transcribing much easier. For a recent interview I conducted I turned to Otter.ai, which is built specifically for transcribing. I actually tried Google AI Studio first, but when I went to try and ask it for help transcribing I got a message stating it doesn't currently accept MP3 uploads. Ironically enough, Google AI Studio recommended I use Otter.ai. If Google suggested I use a competitor, then who was I to argue? So, what is it like using Otter.ai to transcribe interviews and how well does it perform the task? Read on to find out. Otter is an AI-based transcription program that uses voice recognition software to transcribe interviews, meetings, conversations, and other real-time events. Currently, Otter only translates English, Spanish, and French. Otter.ai offers free and paid versions. The Pro and Business options cost $17 or $30 a month, respectively. Yearly plans can save you up to 51%. There is also an Enterprise tier for organizations. The paid tiers are most useful for folks who frequently attend meetings throughout the week and require a streamlined way of keeping track of what was discussed. If you're like me and only need it for the occasional interview transcription, the free version is fine. However, you're limited to 30 minutes per conversation, 300 total minutes of overall use, and 3 transcriptions with the free option. Since I only needed a single interview transcribed, I signed up for the free version of Otter.ai by logging in with my Google account. Since I didn't know the process, I asked Otter.ai how I could upload an MP3 file for it to transcribe. This required me to go to the home page and import the file. If you're used to uploading images or other files, all of this is rather straightforward. The upload took about six minutes. When it finished, Otter.ai produced a rough transcription of the interview. By "rough," I mean that it looked like a simple text file denoting the speakers (in this case, myself and the interviewee)in the interview. However, after some time, Otter produced a cleaner version that appeared like a group text chat, with different speakers having their own color avatar bubbles. Otter.ai even produced a summary of the interview, which is very helpful for detailing the most important points. One interesting aspect of the interview I conducted was that it was with Japanese speakers. Since Otter.ai doesn't yet translate that language, the text it produced was only for what I asked and what the translator translated. However, the program did transcribe the bits of English the Japanese speakers said, which made things a little messier when looking through the transcript. As I mentioned above, the free version of Otter.ai only allows for 30 minutes per conversation. Thankfully, my interview was just shy of 30 minutes, so Otter.ai transcribed the entire conversation. Though the transcription isn't perfect, making minor errors throughout, I found the end result more than satisfying. I still have to edit the interview as I would one I had meticulously written myself. But I was spared the process of actually writing everything out, which is exactly what I wanted from this software. I'm a tech writer, so you might think I regularly use AI for work. While I dabble with the technology when reviewing the best laptops with AI capabilities like the Snapdragon X Elite-powered HP OmniBook X or Dell XPS 13, I don't regularly use AI. It's admittedly a point of pride that I produce work without AI assistance. However, my disdain for transcribing overrides my hesitation to use AI -- hence why I gave Otter.ai a shot to transcribe an interview. And I'm glad I did. Will I continue using Otter.ai to help me transcribe interviews and will I recommend it to others? Absolutely. Otter.ai will make my life much easier, which is exactly what AI has been promising. If you're like me and somewhat skeptical about AI, I suggest you give Otter.ai a shot if you need something transcribed. I promise you won't regret it.

Tom's Guide

Sat, 25 Jan, 2:12 PM UTC

I've Finally Found the Best Free Transcription Service

9 Free Open-Source Alternatives That Can Replace Your Paid Productivity Apps AI transcription platforms abound, but most are just okay, unless you're willing to pay for over-priced subscriptions with lackluster features. So when I stumbled on a solid tool that let me transcribe the interviews I conduct for my freelancing gigs, I couldn't believe it was free. Turboscribe Actually Is Simple and Free Trial and error led me through more than a few transcription services that would make quick work of my interviews -- a few on mobile and a few online, but everything had a catch. The UI's were weird, I had to pay per minute of recorded audio, or I had to subscribe to get any features that justified using that particular service. More googling led me to Turboscribe.ai, and that's where I found a simple user interface with a plain premise -- three free transcriptions per day, up to 30 minutes each. No gimmicks, no free trial period, no intense upselling. It works quickly, it's easy to use, and it's free forever. What Makes the App Particularly Great You get a lot of bang for your buck, and by that, I mean literally zero dollars. The free tier of Turboscribe offers exactly what I mentioned before (three free 30-minute transcriptions per day) with a few add-ons to boot. You can choose how fast you want your file to be transcribed by picking from one of the three options. Cheetah is the fastest option, but it's not going to be as accurate. Dolphin isn't as fast as Cheetah, but it's more accurate. Finally, Whale gets you the most accuracy, but you're gonna have to wait a minute. What I found is that you don't need to wait much longer than that. Even longer audio files (40-50 minutes) wrap a Whale transcription in under ten minutes, if not less than five. To date, I've only used Whale, and I've never felt like I'd waited too long. You can tell TurboScribe how many speakers are in the file, and then name the speakers once the transcription is finished, which is essential for deciphering and using the transcription later on down the road. Most impressively, you can translate foreign language audio directly to English, along with an option that will clean up noisy audio for you. If you want to pay for the Unlimited tier and unlock the service's full features, it's just $20 per month or $10 per month if paid yearly. This gets you unlimited uploads of up to ten-hour-long audio files and more export options if you want the transcription file in something other than PDF or TXT format. I still use the free tier of Turboscribe for quick transcription of audio interviews, and as a media professional, I can't recommend it enough.

MakeUseOf

Mon, 17 Mar, 12:12 PM UTC

MacWhisper 12 Introduces On-Device Speaker Recognition, Outperforming Apple Intelligence

MacWhisper 12, a leading AI transcription app, introduces on-device automatic speaker recognition, setting a new standard in AI-powered transcription technology and outperforming Apple's built-in intelligence features.

2 Sources

Wed, 19 Mar, 4:03 PM UTC

Best AI transcription app for the Mac comes to iPhone and iPad - 9to5Mac

One of the best Mac apps that we always recommend has made the jump to the iPhone and iPad. If you ever find yourself wanting to convert spoken audio into text, this AI-powered app is a must-have. MacWhisper is now available on the App Store for iPhone and iPad, bringing the core transcription experience to iOS. It took a bit longer than planned, but the first version of MacWhisper for iOS is now available! Quickly transcribe audio messages from apps such as iMessage, WhatsApp and Voice Memos, or transcribe files directly from the Files app. You can also make a new recording directly in the app. The iPhone and iPad app uses local AI models to transcribe audio for free. You can optionally use cloud models on iOS with a paid subscription to cover the server costs. MacWhisper uses AI to transcribe audio into text, summarize transcribed text, and break text up into segments for easily parsing. You can also use chat prompts to work with your transcribed text. While the 1.0 version is fairly straightforward, the MacWhisper team plans a number of upcoming features specific to the iPhone and iPad to improve the experience:

9to5Mac

Tue, 22 Apr, 4:20 PM UTC

Similar products

Whisper Memos

Whisper Memos transcribes your iOS voice memos and sends you an email with the transcription a few minutes later, utilizing OpenAI's Whisper technology.

Free Trial

ScriptMe

ScriptMe is a transcription and subtitling service that leverages artificial intelligence to convert audio and video content into text and create subtitles efficiently.

Free Trial

Voicepen

Quickly transcribe or convert audio and voice memos into blog posts; with the most powerful AI speech models.

Contact for Pricing

Plainscribe

PlainScribe is a digital transcription and translation service that converts audio and video files into searchable text.

Paid

TurboScribe

TurboScribe: Your ultimate AI-powered transcription tool that converts audio and video to text with unmatched speed and accuracy.

Free Trial

Don’t drown in AI news. We cut through the noise - filtering, ranking and summarizing the most important AI news, breakthroughs and research daily. Follow topics that matter to you and stay ahead.

Explore

News Categories

Technology Business Policy Startups Health Science Entertainment

Terms Privacy Content Contact Us