whisper.cpp/examples/talk-llama/eleven-labs.py

import sys
import argparse
import textwrap

parser = argparse.ArgumentParser(add_help=False,
    formatter_class=argparse.RawTextHelpFormatter)
parser.add_argument("-q", "--quick", action="store_true",
    help="skip checking the required library")

modes = parser.add_argument_group("action")
modes.add_argument("inputfile", metavar="TEXTFILE",
    nargs='?', type=argparse.FileType(), default=sys.stdin,
    help="read the text file (default: stdin)")
modes.add_argument("-l", "--list", action="store_true",
    help="show the list of voices and exit")
modes.add_argument("-h", "--help", action="help",
    help="show this help and exit")

selopts = parser.add_argument_group("voice selection")
selmodes = selopts.add_mutually_exclusive_group()
selmodes.add_argument("-n", "--name",
    default="Arnold",
    help="get a voice object by name (default: Arnold)")
selmodes.add_argument("-v", "--voice", type=int, metavar="NUMBER",
    help="get a voice object by number (see --list)")
selopts.add_argument("-f", "--filter", action="append", metavar="KEY=VAL",
    default=["use case=narration"],
    help=textwrap.dedent('''\
        filter voices by labels (default: "use case=narration")
        this option can be used multiple times
        filtering will be disabled if the first -f has no "=" (e.g. -f "any")
        '''))

outmodes = parser.add_argument_group("output")
outgroup = outmodes.add_mutually_exclusive_group()
outgroup.add_argument("-s", "--save", metavar="FILE",
    default="audio.mp3",
    help="save the TTS to a file (default: audio.mp3)")
outgroup.add_argument("-p", "--play", action="store_true",
    help="play the TTS with ffplay")

args = parser.parse_args()

if not args.quick:
    import importlib.util
    if importlib.util.find_spec("elevenlabs") is None:
        print("elevenlabs library is not installed, you can install it to your enviroment using 'pip install elevenlabs'")
        sys.exit()

from elevenlabs import voices, generate, play, save

if args.filter and "=" in args.filter[0]:
    voicelist = voices()
    for f in args.filter:
        label, value = f.split("=")
        voicelist = filter(lambda x: x.labels.get(label) == value, voicelist)
    voicelist = list(voicelist)
else:
    voicelist = list(voices())

if args.list:
    for i, v in enumerate(voicelist):
        print(str(i) + ": " + v.name + " " + str(v.labels))
    sys.exit()

if args.voice:
    voice = voicelist[args.voice % len(voicelist)]
else:
    voice = args.name
    # if -n should consult -f, use the following
    #voice = next(x for x in voicelist if x.name == args.name)

audio = generate(
    text=str(args.inputfile.read()),
    voice=voice
)
if args.play:
    play(audio)
else:
    save(audio, args.save)
talk, talk-llama : add basic example script for eleven-labs tts (#728) 2023-04-14 18:53:58 +02:00			`import sys`
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 08:24:47 +01:00			`import argparse`
			`import textwrap`
talk, talk-llama : add basic example script for eleven-labs tts (#728) 2023-04-14 18:53:58 +02:00
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 08:24:47 +01:00			`parser = argparse.ArgumentParser(add_help=False,`
			`formatter_class=argparse.RawTextHelpFormatter)`
			`parser.add_argument("-q", "--quick", action="store_true",`
			`help="skip checking the required library")`

			`modes = parser.add_argument_group("action")`
			`modes.add_argument("inputfile", metavar="TEXTFILE",`
			`nargs='?', type=argparse.FileType(), default=sys.stdin,`
			`help="read the text file (default: stdin)")`
			`modes.add_argument("-l", "--list", action="store_true",`
			`help="show the list of voices and exit")`
			`modes.add_argument("-h", "--help", action="help",`
			`help="show this help and exit")`

			`selopts = parser.add_argument_group("voice selection")`
			`selmodes = selopts.add_mutually_exclusive_group()`
			`selmodes.add_argument("-n", "--name",`
			`default="Arnold",`
			`help="get a voice object by name (default: Arnold)")`
			`selmodes.add_argument("-v", "--voice", type=int, metavar="NUMBER",`
			`help="get a voice object by number (see --list)")`
			`selopts.add_argument("-f", "--filter", action="append", metavar="KEY=VAL",`
			`default=["use case=narration"],`
			`help=textwrap.dedent('''\`
			`filter voices by labels (default: "use case=narration")`
			`this option can be used multiple times`
			`filtering will be disabled if the first -f has no "=" (e.g. -f "any")`
			`'''))`

			`outmodes = parser.add_argument_group("output")`
			`outgroup = outmodes.add_mutually_exclusive_group()`
			`outgroup.add_argument("-s", "--save", metavar="FILE",`
			`default="audio.mp3",`
			`help="save the TTS to a file (default: audio.mp3)")`
			`outgroup.add_argument("-p", "--play", action="store_true",`
			`help="play the TTS with ffplay")`

			`args = parser.parse_args()`
talk, talk-llama : add basic example script for eleven-labs tts (#728) 2023-04-14 18:53:58 +02:00
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 08:24:47 +01:00			`if not args.quick:`
			`import importlib.util`
			`if importlib.util.find_spec("elevenlabs") is None:`
			`print("elevenlabs library is not installed, you can install it to your enviroment using 'pip install elevenlabs'")`
			`sys.exit()`
talk, talk-llama : add basic example script for eleven-labs tts (#728) 2023-04-14 18:53:58 +02:00
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 08:24:47 +01:00			`from elevenlabs import voices, generate, play, save`

			`if args.filter and "=" in args.filter[0]:`
			`voicelist = voices()`
			`for f in args.filter:`
			`label, value = f.split("=")`
			`voicelist = filter(lambda x: x.labels.get(label) == value, voicelist)`
			`voicelist = list(voicelist)`
			`else:`
			`voicelist = list(voices())`

			`if args.list:`
			`for i, v in enumerate(voicelist):`
			`print(str(i) + ": " + v.name + " " + str(v.labels))`
			`sys.exit()`

			`if args.voice:`
			`voice = voicelist[args.voice % len(voicelist)]`
			`else:`
			`voice = args.name`
			`# if -n should consult -f, use the following`
			`#voice = next(x for x in voicelist if x.name == args.name)`
talk, talk-llama : add basic example script for eleven-labs tts (#728) 2023-04-14 18:53:58 +02:00
examples : update elevenlabs scripts to use official python API (#837) * Update elevenlabs example to use ufficial python API * Update elevenlabs example to use official python API 2023-05-24 20:11:01 +02:00			`audio = generate(`
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 08:24:47 +01:00			`text=str(args.inputfile.read()),`
			`voice=voice`
examples : update elevenlabs scripts to use official python API (#837) * Update elevenlabs example to use ufficial python API * Update elevenlabs example to use official python API 2023-05-24 20:11:01 +02:00			`)`
talk, talk-llama : pass text_to_speak as a file (#1865) * talk-llama: pass file instead of arg it is too hard to quote text in a portable way * talk-llama: pass heard_ok as a file * talk-llama: let eleven-labs.py accept options Options: -v voice, -s savefile, -p (--play) * talk-llama: check installed commands in "speak" Pass "-q" to eleven-labs.py to skip checking whether elevenlabs is installed * talk-llama: pass voice_id again in order to sync talk with talk-llama * talk: sync with talk-llama Passing text_to_speak as a file is safer and more portable cf. https://stackoverflow.com/a/59036879/45375 * talk and talk-llama: get all installed voices in speak.ps1 * talk and talk-llama: get voices from api * talk and talk-llama: add more options to eleven-labs.py and remove DEFAULT_VOICE because it is deprecated (https://www.reddit.com/r/ElevenLabs/comments/1830abt/what_happened_to_bella/) ``` usage: eleven-labs.py [-q] [-l] [-h] [-n NAME \| -v NUMBER] [-f KEY=VAL] [-s FILE \| -p] [TEXTFILE] options: -q, --quick skip checking the required library action: TEXTFILE read the text file (default: stdin) -l, --list show the list of voices and exit -h, --help show this help and exit voice selection: -n NAME, --name NAME get a voice object by name (default: Arnold) -v NUMBER, --voice NUMBER get a voice object by number (see --list) -f KEY=VAL, --filter KEY=VAL filter voices by labels (default: "use case=narration") this option can be used multiple times filtering will be disabled if the first -f has no "=" (e.g. -f "any") output: -s FILE, --save FILE save the TTS to a file (default: audio.mp3) -p, --play play the TTS with ffplay ``` * examples: add speak_with_file() as suggested in the review * talk and talk-llama: ignore to_speak.txt 2024-02-24 08:24:47 +01:00			`if args.play:`
			`play(audio)`
			`else:`
			`save(audio, args.save)`