May 2002
- Random Resets John Robert Arras
- Random Resets Kwon Ekstrom
- Random Resets John Robert Arras
- [MLP] The use of ecology models Rayzam
- Interesting things to do (was: Player Accounts on a Non-Commercial MUD) John Buehler
- Apple WWDC? amanda@alfar.com
- Apple WWDC? Nathan F. Yospe
- Apple WWDC? Amanda Walker
- Apple WWDC? Sasha Hart
- Apple WWDC? amanda@alfar.com
- Apple WWDC? Brian Hook
- Apple WWDC? John A. Bertoglio
- Apple WWDC? Daniel.Harman@barclayscapital.com
- Apple WWDC? Matt Mihaly
- Apple WWDC? shren
- Apple WWDC? Matt Mihaly
- Apple WWDC? John A. Bertoglio
- Apple WWDC? James Edward Gray II
- Apple WWDC? Brian Hook
- Encouraging groups without grouping Brian 'Psychochild' Green
- Introduction and My solution to Powergamers, Treadmills, and Content Devaluation. Ken Raisor
- The audience is the medium. For now. John Szeder
- The audience is the medium. For now. John Buehler
- The audience is the medium. For now. Damion Schubert
- The audience is the medium. For now. Ted L. Chen
- The audience is the medium. For now. Shane Gough
- The audience is the medium. For now. Michael Tresca
- The audience is the medium. For now. Ted L. Chen
- The audience is the medium. For now. Marian Griffith
- The audience is the medium. For now. Vincent Archer
- The audience is the medium. For now. Damion Schubert
- The audience is the medium. For now. David B. Held
- The audience is the medium. For now. Ted L. Chen
- The audience is the medium. For now. Marian Griffith
- The audience is the medium. For now. F. Randall Farmer
- The audience is the medium. For now. J C Lawrence
- The audience is the medium. For now. Frank Crowell
- Ownership of characters Jasper McChesney
- Explorers? (Was: Codename Blue & Facets - Nick Yee's new studies) Brian 'Psychochild' Green
- Befriending Critters (was: Random Resets) Arthaey
- Questions about server design Ben Chambers
- Questions about server design Michael Bayne
- Questions about server design Sean Middleditch
- Questions about server design Mike Shaver
- Questions about server design Kwon Ekstrom
- Questions about server design szii@sziisoft.com
- Questions about server design F. Randall Farmer
- Questions about server design James Edward Gray II
- Questions about server design Shane Gough
- Extensibility Ben Chambers
- Extensibility Ammon Lauritzen
- Extensibility Kwon Ekstrom
- Extensibility Ben Chambers
- Extensibility "Christopher {siege} " OBrien
- Extensibility Sean Middleditch
- Extensibility Sean Kelly
- Extensibility John Buehler
- Extensibility shren
- Extensibility John Buehler
- Extensibility szii@sziisoft.com
- Extensibility John Buehler
- Extensibility szii@sziisoft.com
- Extensibility Mike Shaver
- Question: Any published research on Sims type game personae? susan wu
- [MLP] Why care about levels? (was: The use of ecolo Richard Woolcock
- [MLP] Why care about levels? (was: The use of ecolo gy models) Jon Lambert
- Component Design (was: Extensibility) Scion Altera
- [BIZ] Selling Stock (formerly Blacksnow revisted ) Robert A. Rice, Jr.
- Games are Hot. Period Michael Tresca
- Games are Hot. Period Gladimir
- Games are Hot. Period Richard Aihoshi aka Jonric
- Games are Hot. Period Freeman, Jeff
- Games are Hot. Period Richard Aihoshi aka Jonric
- Games are Hot. Period Koster, Raph
- Games are Hot. Period Vincent Archer
- Games are Hot. Period Richard Aihoshi aka Jonric
- Games are Hot. Period Matt Mihaly
- fun Matt Mihaly
- [TECH] Shortest Path William Murdick
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction John Buehler
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
John Buehler writes:
> In response to Ted L. Chen:
> I've thought about it and I'm sure a number of others have as
> well. Personally, I consider the problems of STT and TTS to be a
> black box issue that others are tackling. What I want from those
> two things really boils down to the following:
> 1. The ability to capture continuously-spoken language or
> conventionally-written text into a compact form.
> 2. The ability to convert that compact form into either
> continuously-spoken language or conventionally-written text.
> In the case of the language or text, inflection/tonality/whatever
> should be part of what the compact form can represent.
> As an example, if I type "How are YOU today?", or I type "How are
> you today?!?", the compact form should be storing two somewhat
> different representations, just as if I say the questions
> differently. And the output of each should be representative of
> what was typed/stated, regardless of whether it is presented as
> text or speech. Text is obviously capable of a smaller spectrum
> of inflection and such, but what it is capable of should be
> retained.
I didn't think about that. Hmm... this begs the question of how
much onus we put on the player to encode their own text.
In one way, we can attempt to use heuristics to infer that "YOU" is
spoken with an emphasis instead of saying it as "you" or "Y.O.U.".
I think that's rather difficult because it requires that YOU be
placed into the context of the sentence. Just the fact that it is
capitalized doesn't mean it should get inflection, otherwise IBM
would sound really weird :)
This draws a close analogy to the problems in hand-writing
recognition. Taking a cue from the PalmOS graffiti, perhaps users
can be expected to utilize standardized tags. "How are *YOU*
today?" Some people already type it like this for emphasis. Other
encoding flags such as "-" or "..." could be used to denote pauses
as well. In essence, these tags need not be the same as the tags
used internally by the TTS. More likely, they would be meta-tags
which encompass a way of speaking, rather than the mechanics (i.e.
volumn, pitch) of the phonemes.
As a slight tangent, the text input can also expand oft-used
acronyms such as rotfl or lol.
Note that the default TTS would tend to have some built-in
heuristics that seem to be common. For instance, the L&H TTS engine
currently puts your standard raised inflection when it encounters a
question mark at the end of an input stream.
> The goal here is to have players both typing and speaking to the
> program, with the information efficiently conveyed to those who
> should receive it, to be output as written text or spoken word as
> desired by the receiver.
Ah, that's the rub. At least with the phoneme method. The
difficulty that most STT encounter is in that final stretch where
you determine what string of phonemes can constitute a word - or
more precisely, which word. What I'm prescribing is more like STP
(speech to phoneme). So, at least in the near future, where
processing capability is still growing, we may need to restrict the
output to speech only. Like the old days of TV before close
captioning became available.
That is of course, if players would forgo that option in exchange
for speech capability.
>> With a TTS, it is quite possible to expressively generate
>> synthesized speech but it currently requires hand coding a lot of
>> tags into the stream and at the phoneme level.
> And as such, would fail the 'conventionally-written' text
> requirement. Existing expressiveness in written text should be
> relied upon. Typing is only going to be used by those who are
> unable to speak, due to physical impediment or due to conditions
> such as not wanting to annoy those around you who are not playing
> the game. In any case, we don't want to make conversational input
> slower than it is today.
Had the player been required to encode all the inflections into the
text stream, then yes, it would fail that requirement. However,
this is where the default heuristics in the TTS engine would kick in
- which does a decent job. So for most sentences, it's able to
automatically add the required correct inflections. It's just for
the special cases such that you outlined above where the current
heuristics fail.
As for the expressive quality of the standard TTS, it will sound
rather bland or dead pan after a while because everyone is talking
exactly the same way. If everyone used text as the primary method
of input, it might seem like we walked into a bad voice actor's
convention. That's why I made the comment about encoding more tags.
It's not required for basic communication on the order of what we
currently have with text, but it does help in breaking up the
monotony. And hence the suggestion that be included in the
speech->phoneme decomposition.
> I believe that both original speech and manufactured speech are
> needed. Original speech transport is needed when players are
> speaking to players (telephone). Manufactured speech is needed
> when characters are speaking to characters (acting). I want both
> in the same game so that I can have a clear separation of in-game
> and out-of-game conversations available to players. If I want to
> talk about baseball, I can do it via my own voice. If I want to
> have my character discuss the balance of its weapon, I can do it
> via my character's voice. Note that my own voice can be sent to
> any player in the game willing to receive it, while my character's
> voice is limited to how far it carries in the game environment.
> I would be content with current primitive STT and TTS systems such
> that I can speak and the characters can talk. The differentiation
> of which character is saying what can be worked out via graphical
> cues and such. I just want somebody to put the thing in.
Interesting. How close to your own voice does it need to be for the
player to player communication? I fully understand that using a
speech->phoneme->speech method isn't a full reproduction of your
voice, so would it be enough that it has the same patterns and
somewhat same tone as your voice? It might be similar to trying to
establish a conversation on a noisy telephone line - it says it's
Bubba, and it sounds kinda like Bubba, but is it really Bubba? You
can tell at least that it's definitely not Buffy.
> The issue of phonemes as the specific technology is not
> significant to me, any more than whether the database being used
> is relational or object-oriented, so long as it has the
> operational characteristics that I'm after.
Perhaps I'm too much of an engineer, but I see no value in giving
treatment to only the initial conceptual stage of a design and
assuming the rest as idealized black-boxes. Sure, design
requirements drive implementation. However, implementation
possibilities often drive the softer design requirements. A lot of
design focuses on determining just what the limits of these
black-boxes impose on the overall design and the tradeoffs
associated with it.
So in the case of phonemes, the limits it imposes is that it allows
for decent generation of tag data for the speech engine, but at the
cost of not being able to display text on the recipients' side.
That's an awful strong limit if your design has a hard requirement
to give the recipient the choice between the output as either text
or speech. Same type of limits can be derived from RDBS and OODB
foundations and do impact design downstream (and to a lesser extent,
upstream).
Speaking of which, anyone have a good design structure matrix (DSM)
for MMORPGs? Or is this too nascent or wide a field for one to
exist?
TLC
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Eli Stevens
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Jon Leonard
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Hans-Henrik Staerfeldt
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction Robert Zubek
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction ceo@grexengine.com
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Hans-Henrik Staerfeldt
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction Koster, Raph
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction lynx@lynx.purrsia.com
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Rudy Neeser
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Matt Mihaly
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction Daniel.Harman@barclayscapital.com
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction John Buehler
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Ted L. Chen
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction Mike Shaver
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction John Buehler
- [TECH] Voice in MO* - Phoneme Decomposition and Reconstruction amanda@alfar.com
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction Koster, Raph
- [TECH] Voice in MO* - Phoneme Decomposition and Rec onstruction Steve {Bloo} Daniels
- Combat with Style (was Player Accounts on a Non-Commercial MUD) lynx@lynx.purrsia.com
- Playskins Koster, Raph
- People were talking about resets.. Anderson, David
- People were talking about resets.. Sasha Hart
- People were talking about resets.. Anderson, David
- People were talking about resets.. John Buehler
- People were talking about resets.. Anderson, David
- People were talking about resets.. Arnau Rossell=?US-ASCII?Q?=F3?= Castell=?US-ASCII?Q?=F3?=
- People were talking about resets.. shren
- People were talking about resets.. Anderson, David
- People were talking about resets.. David B. Held
- People were talking about resets.. shren
- People were talking about resets.. Anderson, David
- People were talking about resets.. Michael Tresca
- People were talking about resets.. Fred Clift
- People were talking about resets.. Lars Duening
- People were talking about resets.. Michael Tresca
- People were talking about resets.. Rayzam
- People were talking about resets.. Matt Mihaly
- People were talking about resets.. Tand'a-ur
- People were talking about resets.. John Buehler
- People were talking about resets.. Leland Hulbert II
- People were talking about resets.. lynx@lynx.purrsia.com
- People were talking about resets.. Jason Murdick
- People were talking about resets.. Jeff Lindsey
- People were talking about resets.. Ben Chambers
- People were talking about resets.. Sasha Hart
- People were talking about resets.. David B. Held
- People were talking about resets.. Sasha Hart
- People were talking about resets.. Anderson, David
- People were talking about resets.. David B. Held
- People were talking about resets.. Acius
- People were talking about resets.. Marian Griffith
- Question about copyovers. Anderson, David
- Question about copyovers. Tand'a-ur
- Question about copyovers. Anderson, David
- Question about copyovers. Kwon Ekstrom
- Question about copyovers. Adam
- Question about copyovers. Oliver Jowett
- Question about copyovers. Kwon Ekstrom
- Question about copyovers. Lars Duening
- Question about copyovers. Jon Lambert
- Question about copyovers. fred@clift.org
- Question about copyovers. Zach Collins {Siege}
- Question about copyovers. Smith, David {Lynchburg}
- non-violent activities (was People were talking about resets..) Ammon Lauritzen
- References on personality and emotion models Robert Zubek
- Multimodal interface conference. Rayzam
- Hyperbolies R Us shren
- Hyperbolies R Us Matt Mihaly
- TECH: Systems Administration Issues Thomas Leavitt
- Questions about ... XML as data format Adam
- Questions about ... XML as data format Anderson, David
- Questions about ... XML as data format Kwon Ekstrom
- R&D Matt Mihaly
- Conversation logs? Robert Zubek
- Conversation logs? Rudy Fink
- Conversation logs? Vincent Archer
- Conversation logs? Shane Gough
- DGN: Elastic Advancement in MUDs? Jeff Lindsey
- DGN: Elastic Advancement in MUDs? David B. Held
- Building histories off civilizations automatically adam Martin
- ADMIN: Virii and mail forgeries J C Lawrence
- Hi from the Dragon Empire's CLM Peter Tyson
- Linux gaming ( was Apple WWDC? ) Kevin Mack
- [TECH] Preferred LPC replacement? Jeff Bachtel
- [TECH] Preferred LPC replacement? Damion Schubert
- Who `owns' conversation logs? Joshua Judson Rosen
- The Online Gaming Life for Me! Michael Tresca
- Boredom Ben Chambers
- Law of Diminishing Marginal Utility [was Boredom] Ron Gabbard
- Law of Diminishing Marginal Utility [was Boredom] Caliban Tiresias Darklock
- "MMOG" Bible Brian 'Psychochild' Green
- "MMOG" Bible David Kennerly
- TECH: Single process v.s. multi process? Philip Mak
- TECH: Single process v.s. multi process? Smith, David {Lynchburg}
- TECH: Single process v.s. multi process? Bruce Mitchener
- TECH: Single process v.s. multi process? Bruce Mitchener
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Clay
- In defense of "soloability" [was Law of Diminishing Marginal Utility] apollyon
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Caliban Tiresias Darklock
- In defense of "soloability" [was Law of Diminishi ng Marginal Utility] Koster, Raph
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Michael Tresca
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Paul Schwanz
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Martin C. Martin
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Dave Rickey
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Sanvean
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Matt Mihaly
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Michael Tresca
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Kwon Ekstrom
- In defense of "soloability" [was Law of Diminishing Marginal Utility] Ron Gabbard
- Game shows Peter Tyson
- What keeps people interested in social muds? Martin C. Martin
- What keeps people interested in social muds? Rudy Fink
- What keeps people interested in social muds? Martin C. Martin
- What keeps people interested in social muds? Joshua Judson Rosen
- What keeps people interested in social muds? Ron Gabbard
- What keeps people interested in social muds? Marc Bowden
- What keeps people interested in social muds? Joshua Judson Rosen
- What keeps people interested in social muds? lynx@lynx.purrsia.com
- What keeps people interested in social muds? Richard A. Bartle
- What keeps people interested in social muds? Martin C. Martin
- What keeps people interested in social muds? Richard A. Bartle
- New Beginings Drylar Levre
- New Beginings Acius
- New Beginings ceo@grexengine.com
- New Beginings Bruce Mitchener
- New Beginings David B. Held
- New Beginings Kwon Ekstrom
- New Beginings David B. Held
- New Beginings Kwon Ekstrom
- New Beginings Sean Kelly
- New Beginings Lars Duening
- New Beginings David B. Held
- New Beginings Lars Duening
- New Beginings Bruce Mitchener
- New Beginings David B. Held
- New Beginings Kwon Ekstrom
- New Beginings Paul Schwanz
- New Beginings Zach Collins {Siege}
- New Beginings Miroslav Silovic
- New Beginings David B. Held
- New Beginings Draymoor a Vin il'Rogina
- New Beginings David B. Held
- New Beginings Kwon Ekstrom
- New Beginings Bruce Mitchener
- [DGN] Creating a MUD Richard Krush
- [DGN] Creating a MUD Acius
- [DGN] Creating a MUD Caliban Tiresias Darklock
- [DGN] Creating a MUD Edward Glowacki
- [DGN] Creating a MUD Kwon Ekstrom
- [DGN] Creating a MUD Edward Glowacki
- [DGN] Creating a MUD Fred Clift
- [DGN] Creating a MUD David Bennett
- [DGN] Creating a MUD fred@clift.org
- [DGN] Creating a MUD Damion Schubert
- [DGN] Creating a MUD Taylor
- [DGN] Creating a MUD Matt Mihaly
- [DGN] Creating a MUD Daniel.Harman@barclayscapital.com
- On the creation of constructive/social behaviours in online games! Marc Demesel
- [DGN] MUD Books James Edward Gray II
- [DGN] MUD Books Scion Altera
- [DGN] MUD Books Jeremy Noetzelman
- [DGN] MUD Books Tand'a-ur
- Positive reinforcement for socializing [was In de fense of "soloability" ] Jeff Lindsey
- Character skill distribution and trade-offs Ron Gabbard
- Character skill distribution and trade-offs Daniel.Harman@barclayscapital.com
- Character skill distribution and trade-offs Vincent Archer
- Character skill distribution and trade-offs Daniel.Harman@barclayscapital.com
- Character skill distribution and trade-offs John Buehler
- Character skill distribution and trade-offs Sean Kelly
- Character skill distribution and trade-offs Ron Gabbard
- Space partitioning, R-Trees? Dread Quixadhal
- Space partitioning, R-Trees? Daniel.Harman@barclayscapital.com
- Space partitioning, R-Trees? Hans-Henrik Staerfeldt
- Space partitioning, R-Trees? Crosbie Fitch