I Stole my Friend's Voice With Ai

Corridor Crew
6 Mar 202219:59

TLDRIn a fascinating exploration of AI's potential and ethical boundaries, the video documents the process of creating a synthetic voice for a friend, Jake, without his initial consent. The creator, Sam, utilizes various AI voice platforms, ultimately choosing Dscript's Lyrebird algorithm to generate a voice that convincingly mimics Jake's. Despite the high quality of the AI voice, ethical concerns arise as Sam acknowledges the need for consent when using someone's voice. In a twist, Sam 'manufactures' Jake's consent by piecing together words from their podcast to form a fake agreement. The video humorously addresses the implications of AI technology while raising questions about privacy, consent, and the potential for misuse. It concludes with Jake giving his consent after hearing the AI voice, highlighting the fine line between innovation and intrusion.

Takeaways

  • 😀 The project explores using AI to recreate an individual's voice, initially without their consent, focusing on Jake as the subject.
  • 🤖 AI voice technology has advanced to a point where voices can be recreated with remarkable accuracy, using services like Descript which acquired Lyrebird.
  • 🎤 Consent is emphasized as crucial in the process of voice cloning, requiring verbal confirmation from the person whose voice is being used.
  • 👨‍💻 The ethical dilemma of using AI for voice cloning without consent is a central theme, questioning the morality versus the technological capability.
  • 🔧 Technical challenges involve gathering enough of Jake's voice samples to convincingly replicate his speech for AI purposes.
  • 🤔 Jake's initial unawareness of the project highlights concerns about privacy and the right to one's own voice and likeness.
  • 📜 A faux consent was engineered using pieced-together audio clips to meet service requirements, showcasing both technical ingenuity and ethical breaches.
  • 😅 The project's reveal to Jake leads to a discussion on the importance of consent and the potential personal and professional implications of AI voice cloning.
  • 🌟 The AI voice is ultimately used to create a positive team-building message, attempting to showcase a beneficial use of the technology.
  • ✅ After some persuasion, Jake gives his post-hoc consent for the use of his AI-cloned voice, reflecting a nuanced resolution to the ethical issues presented.

Q & A

  • What is the main topic of the video?

    -The main topic of the video is the creation of an AI voice using deep fakes technology without the subject's consent, and the ethical implications of such actions.

  • Who is the subject of the AI voice recreation?

    -The subject of the AI voice recreation is Jake, a character in the video who is unaware of the process.

  • What is the purpose of using AI to recreate someone's voice?

    -The purpose, as mentioned in the video, is to alleviate Jake's workload by automating tasks that require his vocal presence.

  • What are the ethical concerns raised in the video regarding the use of AI to recreate voices?

    -The ethical concerns include the lack of consent from the subject, potential misuse of the voice for malicious purposes, and the violation of privacy and personal rights.

  • How does the video creator attempt to gain Jake's consent after the fact?

    -The creator shows Jake the final product and its positive effects on others, hoping that Jake would feel he would have agreed to it had he known.

  • What is the role of Jake in the Corridor Crew?

    -Jake is in charge of managing everyone's schedules and workloads, studio meetings, presentations, and he also handles almost all the brand integrations for every video.

  • What is the significance of the phrase 'easier to ask for forgiveness than permission' in the context of the video?

    -The phrase is used to illustrate the creator's approach to using Jake's voice without consent, suggesting that they will seek forgiveness after the fact rather than obtaining permission beforehand.

  • How does the video address the issue of consent for using someone's voice in AI?

    -The video addresses the issue by showing the process of obtaining consent through a voice recording that verifies the person's identity and agreement to the use of their voice.

  • What is the final outcome of the video regarding Jake's voice and consent?

    -The final outcome is that Jake gives his consent for the use of his AI voice in the video after seeing its potential benefits.

  • What is the potential application of Jake's AI voice suggested in the video?

    -The potential application suggested is for team building presentations, inspirational messages, and possibly for branded segments in the future.

  • How does the video end?

    -The video ends with Jake giving his consent for the use of his AI voice in the video, and the creator promising to ask for consent every time before using it in the future.

Outlines

00:00

🤖 AI Voice Creation and Consent Issues

The first paragraph introduces the idea of using AI to recreate someone's voice, specifically Jake's, without his knowledge. It discusses the ethical concerns and technical challenges of voice synthesis, including obtaining consent. The speaker explores various AI voice services and their limitations, eventually using a company called Dscript to create a convincing AI voice. The process involves recording a voice dataset and using it to train an AI model. The speaker also humorously includes a song dedicated to friends, showcasing the AI's capabilities.

05:02

🚫 Ethical Dilemmas in AI Voice Synthesis

The second paragraph delves into the ethical implications of creating an AI voice without the subject's consent. It highlights the need for consent when submitting a voice set to transcription services and the speaker's decision to manufacture consent by piecing together words to form a fake consent statement. The paragraph also touches on the potential misuse of AI technology and the speaker's belief that Jake would eventually agree to the use of his AI voice after seeing the positive impact.

10:04

🎯 Utilizing AI for Team Building and Motivation

The third paragraph outlines a plan to use Jake's AI voice for an inspirational team-building presentation. The speaker uses GPT-3 to create a script that reflects Jake's personality and beliefs, including his Texan roots and business philosophies. The AI voice is used to deliver a motivational message about the importance of team players and the role of culture in business success. The goal is to demonstrate the AI voice's potential to Jake and gain his consent after the fact.

15:04

📝 Addressing Consent and the Future of AI Voice Usage

The fourth and final paragraph describes the aftermath of revealing the AI voice to Jake. The speaker admits to creating the AI voice without Jake's permission and discusses the legal and ethical issues surrounding the use of someone's voice without consent. Jake is asked for his consent to use the AI voice in the video, which he eventually grants. The paragraph concludes with a discussion about the potential future uses of AI voice technology and a call to action for viewers to explore more AI-related content on their website.

Mindmap

Keywords

💡AI

Artificial Intelligence (AI) refers to the simulation of human intelligence in machines that are programmed to think like humans and mimic their actions. In the video, AI is used to recreate a friend's voice, which is a central theme as it raises ethical questions about consent and privacy.

💡Deep Fakes

Deep fakes are synthetic media in which a person's likeness and voice are replaced with someone else's using AI. The video mentions the use of deep fakes to recreate someone's likeness, which is a significant part of the narrative as it leads to the creation of an AI voice without the subject's initial consent.

💡Lyrebird

Lyrebird is an algorithm that was purchased by a company called D-Script and integrated into their transcribing software. It is used in the video to create a realistic AI voice, which is a key technological tool in the process of voice replication without the subject's initial permission.

💡Consent

Consent in this context refers to the permission given by an individual to use their voice or likeness for a particular purpose. The video script revolves around the ethical dilemma of creating an AI voice without prior consent and the subsequent need to obtain it retroactively.

💡Terms of Service

Terms of Service are the legal agreements between a service provider and its users. In the video, the mention of Terms of Service is related to the process of consent where users agree to let a service use their voice, which becomes a point of contention when consent is manufactured without the subject's knowledge.

💡Tongue Twisters

Tongue twisters are phrases that are designed to be difficult to articulate properly. In the video, they are used as a test for the AI voice's ability to reproduce complex speech patterns, showcasing the advanced capabilities of the AI in mimicking human speech.

💡Inspirational Quotes

Inspirational quotes are sayings that aim to motivate or encourage individuals. In the video, the AI voice is used to deliver such quotes as part of a team-building presentation, demonstrating the potential positive uses of the technology.

💡Team Building

Team building refers to activities that are designed to improve relationships and collaboration within a team. The video presents the idea of using the AI voice for a team-building presentation, which is meant to inspire and motivate the team.

💡Brand Integrations

Brand integrations are marketing strategies where a brand is incorporated into various forms of media content. In the video, the concept is mentioned in the context of using the AI voice for automated brand promotion, highlighting the potential commercial applications of AI voice technology.

💡Public Personalities

Public personalities are individuals who are widely known or recognized by the public. The video discusses the ethical considerations of using public personalities' likenesses and voices in AI without their permission, touching on legal and privacy issues.

💡Non-Consensual

Non-consensual refers to actions that are taken without the agreement or permission of the person involved. The video's main conflict revolves around the creation of an AI voice without the subject's consent, leading to a discussion on the morality and legality of such actions.

Highlights

Sam used AI to recreate his friend Jake's voice without his consent, raising ethical concerns about AI voice synthesis.

The process involved experimenting with various AI voice services like Replica Studios and Resemble.ai.

Dscript's integration of the Lyrebird algorithm was used to create a more natural-sounding AI voice.

To train the AI, Sam recorded himself for 15 minutes, reading a D&D book to create a voice dataset.

The AI voice required a verbal consent, which Sam manipulated by piecing together words from existing podcast recordings.

Sam expressed the potential of AI voices for automating tasks and reducing workload, specifically for Jake.

Jake's initial reluctance to use AI and concerns about privacy and consent were highlighted.

The video demonstrated the creation of an AI voice that could potentially replace Jake in certain tasks.

An AI-generated song dedicated to friends Peter and Ren showcased the versatility of the synthesized voice.

Sam discussed the pressure on Jake and proposed using AI to alleviate his workload.

The video explored the concept of 'manufacturing consent' by piecing together words to mimic agreement.

The ethical dilemma of creating an AI voice without the person's permission was a central theme of the video.

The team used GPT-3 to generate a script for an inspirational team-building presentation using Jake's AI voice.

The presentation aimed to show the positive impact of the AI voice on team morale and motivation.

Jake eventually gave his consent for the video after seeing the potential benefits of the AI voice.

The video concluded with a discussion on the future use of AI voices with proper consent.

Sam emphasized the importance of ethical considerations when using AI to replicate human voices.