WO2008077128A2 - Recombinant music coposition algorithm and method of using the same - Google Patents

Recombinant music coposition algorithm and method of using the same Download PDF

Info

Publication number
WO2008077128A2
WO2008077128A2 PCT/US2007/088212 US2007088212W WO2008077128A2 WO 2008077128 A2 WO2008077128 A2 WO 2008077128A2 US 2007088212 W US2007088212 W US 2007088212W WO 2008077128 A2 WO2008077128 A2 WO 2008077128A2
Authority
WO
WIPO (PCT)
Prior art keywords
musical work
segment
segments
existing
musical
Prior art date
Application number
PCT/US2007/088212
Other languages
French (fr)
Other versions
WO2008077128A9 (en
WO2008077128A3 (en
Inventor
David H. Cope
Original Assignee
Recombinant, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Recombinant, Inc. filed Critical Recombinant, Inc.
Publication of WO2008077128A2 publication Critical patent/WO2008077128A2/en
Publication of WO2008077128A3 publication Critical patent/WO2008077128A3/en
Publication of WO2008077128A9 publication Critical patent/WO2008077128A9/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/0008Associated control or indicating means
    • G10H1/0025Automatic or semi-automatic music composition, e.g. producing random music, applying rules from music theory or modifying a musical piece

Definitions

  • the present invention provides a recombinant music composition algorithm and method of using the same, and, more particularly a linear retrograde recombinant music composition algorithm and method of using the same.
  • Emmy software 2006. Computer Models of Musical Creativity. Cambridge, MA: MIT Press; [0004]
  • the process used within the Emmy software has been referred to throughout the inventor's work generically as Experiments in Musical Intelligence.
  • the fundamental algorithmic sequence of the Emmy software can be represented by the logic flow illustrated in Figure 1, which shows, from a music database 100, operations that include pattern matching step 110, segmentation step 120, hierarchical analysis step 130, non-linear recombination step 140, which result in the output 150.
  • the music database 100 is essentially the embodiment of a musical composition, or musical performance, in a tangible or legible form, format, language, or code that can be interpreted and executed by devices such as a computer, a musical instrument with a digital interface, a sound synthesizer, digital-to-analog audio reproduction system, or any combination of electrical, electro-mechanical, or mechanical musical devices.
  • a musical score is, in and of itself a musical database of a composition
  • a phonograph recording is an analog musical database of a performance.
  • the groupings of notes in a musical phrase are assigned numerical values according to their pitch, duration, location in the work, voice, amplitude and/or other sonic and temporal qualities which characterize the notes.
  • a single event is the grouping of notes which constitute a single beat in a musical work. Collections of numerical values of notes are compiled to represent successively longer measures, phrases, sections, and so forth. These compilations are called event lists, and are susceptible to processing by digital list-processing computer applications (such as the computer language known as LISP, short for List Processor).
  • Event lists describe the various attributes of each note with a single list of parameters of at least five separate but related elements, as follows:
  • the first element of the event list is the on-time, or the time elapsed between the beginning of the work and the initiation of the note.
  • On-times are assigned numerical values in Emmy based on a standard metric of 1,000 ticks per second, which is usually equated with the length of a quarter note.
  • On-times are relative, not absolute.
  • the actual on-time of a pitch is determined by a combination of on-time (location in the score) and tempo (pace of playing). For example, an on-time of 1,000 could begin 1 second after 0 with a tempo of m.m. 60, 2 seconds after 0 with a tempo of m.m. 30, half a second after 0 with a tempo of m.m. 120, and so forth.
  • Events describe only sound eventsinotQ ons and note offs), not silences or rests, relieving databases of vast amounts of unnecessary data. Silences, or rests, are represented by default as the result of a lack of events.
  • the second entry of the event list is pitch.
  • pitch is assigned a numerical value using the established Musical Instrument Digital Interface ("MIDI") standard, with middle C (520 cycles per second) equal to MIDI note number 60.
  • MIDI Musical Instrument Digital Interface
  • Additions and subtractions of 12 produce C in various octaves, and additions and subtractions of 1 create half steps.
  • the numerical sequence 60-62-64-65-67-69-71-72 represents the C major scale with intervening numbers (61,63,66,68,70) producing chromaticism to that key.
  • the third entry of the event list is duration. Duration, as with on-time, is figured to a quarter note's equaling 1,000 ticks; relative durations are figured from that standard.
  • the duration of an event implies the MIDI note off-time, which can be independently figured as the addition of the on-time plus the duration.
  • an event with an on-time of 6,000 and a duration of 1,000 has an off-time of 7,000.
  • Duration, as with on-time, is relative, being a factor of its value within the current tempo.
  • the fourth entry of the event list is channel number.
  • the channel numbers indicate the original voice separation of the music entered into that database(e.g. : soprano, alto, tenor, bass; or, trumpet, saxophone, guitar, drums, etc.).
  • Channel numbers are used to indicate the voice from which events were harvested or will be assigned for performance in the score of the new composition or, perhaps, for performance by a digitally enabled instrument (e.g. an instrument compatible with the industry-standard Musical Instrument Digital Interface).
  • Channel numbers are theoretically unlimited, but in practice, 64 channels are sufficient for most music.
  • the fifth entry of the event list represents dynamics. Dynamics are based on 0 equaling silence and 127 equaling fortissimo, with the numbers between these values being relative to these extremes.
  • Numbering systems while logical are arbitrary and therefore many alternative numbering systems are possible. For instance, a base metric of 10,000 ticks per second could be used for event duration, and a scale of 0 for silence to 254 for fortissimo. Additional entries, a sixth, seventh and so on, into databases may be made as needed for other musical qualities and quantities pertaining to musical notes or events, such as tremolo, aftertouch, and so forth. Events are open-ended; that is, one may add any desired parameter to the end of event lists with no ill effects on the first five elements.Events are compiled into collections of larger phrase, section, or work, lists. Events are typically ordered sequentially (i.e. beat one, then beat two, and so forth) to make visual event reading simple and logical.
  • Databases can be created by manually translating scores into event lists, or by software that automatically scans printed scores (sheet music) and translates them into events lists, by performing the work through a digitally-enabled instrument (sometimes referred to as "step entry") or by software that automatically analyzes performed music and translates it into events lists, or any combination of these techniques.
  • the Emmy algorithm assumes that every work of music contains an inherent set of instructions, or rules, for creating different but highly-related replications of itself -- an assumption which is generally agreed to by musicologists. These instructions, when analyzed and interpreted correctly, lead to important discoveries about this music's structure as well as providing a key to producing new instances of music that are stylistically-faithful to it.
  • the pattern matching step 1 10 is the process of comparing events lists representing musical works or phrases in the musical database to discover what elements they have in common. Highly recurrent patterns in a single work typically represent thematic material, such as a particular melodic line and associated chord progression. However, patterns which recur in more than one work can be construed as the essence of the style of a particular composer or genre.
  • Style is inherent in recurrent patterns of the relationships between the musical events, in more than one work.
  • the primary constituents of these patterns are the quantities and qualities captured and represented in the musical database event lists - essentially pitch, duration, and temporal location in the work — although other factors such as dynamics and timbre may come into play. Patterns may be discerned in vertical, simultaneous relationships, such as harmony, horizontal, time-based relationships, such as melody, as well as amplitude-based relationships (dynamics) and timbral relationships. Patterns might be identical, almost identical, identical but reversed, identical but inverted, similar but not identical, and so forth.
  • the Emmy algorithm searches the databases for such patterns using controllers that either restrict the search to detecting patterns that are highly similar, or widen the search to detect patterns that are loosely similar.
  • the essence of this process is to reiteratively select the event list of differing portions of the music and look for other instances of the same, or similar, events lists elsewhere in the database, and to compile catalogues of matching events lists, ranking them by frequency of occurrence, type, and degree of similarity.
  • the objective of this search is to detect patterns that characterize the commonalities, or "style,” of the bodies of music in the musical databases.
  • the basic twelve-bar blues form - (AAB) — would be discovered and registered as a match in a database containing several blues.
  • Formal patterns generally of long duration, will often have widely varied content within the components of the pattern.
  • the pattern matching controllers are therefore set to discover and compare larger musical structures while de-emphasizing or ignoring the details within the form.
  • a poetry-form pattern matcher would discover the sonnet form by finding commonalities in the number of lines, meter, and rhyming scheme, while ignoring the words. Even if some sonnets in the database were in English and others were in Italian, the form uniting them could be discovered.
  • Matched patterns of shorter duration such as beats, measures and short phrases, are also sought and catalogued. These commonalities are denominated "signatures" in Emmy. In the recomposition process, signatures are preserved and serve to ensure that stylistic qualities are inherent in the musical output.
  • Emmy In the Emmy algorithm, superficial, thematic material specific to a. particular work should not be mistaken for deeper commonalities shared by many works by the same composer, or in the same genre. For instance the "di-di-di-dah" motif of Beethoven's Fifth Symphony is a thematic component specific to that work and is not a signature that is found in very many or all of Beethoven's work.
  • the pattern-matching controllers in Emmy can be adjusted to reject thematic material as superficial and irrelevant to the discovery of signatures. This is achieved by rejecting matches that occur with relatively high frequency in a single work but occur with relatively low frequency, or are entirely absent, in other works.
  • the essential outcomes of the pattern-matching step 1 10 are two-fold: long forms present in the source material, particularly forms of musical-phrase length and larger, are identified for use as templates for future recomposition; and stylistic signatures are captured so that instances of them can be protected (not broken apart) and re-implanted in the composition process.
  • segmentation step 120 In order to recombine music it must, self-evidently, be broken into constituent elements first. This process is referred to in the Emmy software as segmentation step 120. Segments, typically, consist of beats - the groupings of notes which correspond with one beat in the music. However, segmentation of existing musical works into smaller components, and haphazard recombination of them into new orders, would produce musical gibberish, as would fragmenting written language sentences into words and haphazardly recombining the words without regard to grammar (syntax) or meaning (semantics). Although segmentation in the Emmy software is fundamentally straightforward - the identification of each beat and its conversion into an event list - each segment will become progressively more complex as contextual analysis is applied to it.
  • Each beat-segment will accumulate and carry with it at least the following information: the destination note for each note in the beat (i.e. the note in the corresponding voice which follows it in the original work); the grouping of beats, or phrase, to which it belongs; the location of the phrase within the work; its SPEAC value (see below); and whether it is part of a signature that will be protected and not broken apart in the re-composition process.
  • pattern matching step 110 analyzes form from an essentially syntactical point of reference, hierarchical, or SPEAC
  • analysis step 130 investigates the semantic structure of music and provides tools for ensuring that when music is recombined, syntactically correct music is also semantically intelligible.
  • homonyms can have quite different functions and meanings, for instance in the sentence "I saw the saw saw.”
  • the word "saw” appears three times in this sentence, with each appearance having a different meaning and making a different syntactic contribution (subject verb, object noun, object verb, etc.) and semantic meaning (because we know that saws cannot see, we infer that the final appearance cannot be a part of the verb "to see” and must therefore refer to the act of sawing).
  • Only the context distinguishes each word's true function and meaning. The same is may be said of music.
  • Tonal-music leading tones provide an example of how hierarchical analysis differentiates between apparently identical functional motions in music.
  • the same leading-tone note appearing as the fifth of the mediant triad does not necessarily lean toward the tonic note (C), but in fact often moves more naturally elsewhere — the submediant note (A), for example.
  • the same leading-tone note can be analyzed differently depending on its context. This insight provides a very important foundation for Emmy approaches to structural analysis.
  • Emmy software adopts a hierarchical approach to musical analysis, which is based on a combination of musical tension and musical context that are analyzed, evaluated and assigned a numerical weighting.
  • This weighting combination closely parallels the manner in which one hears music, almost regardless of its style or period of composition, and hence represents the core of the analysis component of Emmy composing programs.
  • the hierarchical approach uses a process that goes by the acronym SPEAC, the acronym being based on the identifiers -Statement (S), Preparation (P), Extension (E), Antecedent (A), and Consequent (C) - which will be assigned to events and groupings of events.
  • SPEAC analysis also parses these selected groupings of events to extract information about their role in increasingly large musical structures, from beats to measures, to phrases, sections, and even to whole works. While traditional tonal functions provide analysis of surface detail, the SPEAC approach provides deeper insights into musical structure. In other words, SPEAC derives musical meanings from context as well as from content.
  • Extension is stable — a continuation of material or ideas. Extensions usually follow statements but can follow any SPEAC function.
  • Antecedent is very unstable — requires a consequent function. Antecedents typically precede consequents.
  • PSEAC and SEA are musically logical, progressions such as AEPS and SAPC, while not impossible, are less logical.
  • David Cope: "Algorithmic Composer" p. 194 provides an example of SPEAC analysis as applied to a Bach Chorale.
  • a chord which produced a total of 0.5 would be the antecedent, "A,” in the context of a preceding chord with a value of 0.2 and a succeeding chord with a value of 0.3; however, the identical chord with its value of 0.5 would be a consequent, "C,” in the context of a preceding 0.8 and a succeeding 0.4. While the intervals in the event determine its fixed value, or weighting, the context in which the event occurs determine its relative position in the SPEAC hierarchy - both in the original works in the database and in recombined works.
  • SPEAC hierarchical analysis step 130 After SPEAC hierarchical analysis step 130 has been performed, every event and grouping of events, will carry with it its SPEAC identifiers and weightings, which will be essential in order to accomplish musically-logical, structurally sound, and context-sensitive re-location of events and event-groupings in the recombination process.
  • the Emmy software employs a recombination step 140, which is made possible by prior pattern-matching step 110, segmentation step 120, and prior SPEAC hierarchical analysis step 130 of the database 100.
  • Non-linear recombination step 140 is the compositional process that synthesizes the results of the pattern-matching step 1 10 (form, and signature detection) and the hierarchical SPEAC analysis step 130 (context-sensitive, structural analysis) components of the Emmy algorithm.
  • Tonal music follows well-known principles governing pitch (notably major and minor scale derivation and complementary chromaticism), melody (primarily stepwise motion with leaps often followed by stepwise contrary motion), harmony (having prescribed functions and syntax), voice-leading (mostly stepwise motion with voice independence), hierarchical form (phrases, sections, and movements governed by logical repetitions, variations, and contrasts), and so on.
  • pitch notably major and minor scale derivation and complementary chromaticism
  • melody primarily stepwise motion with leaps often followed by stepwise contrary motion
  • harmony having prescribed functions and syntax
  • voice-leading mostly stepwise motion with voice independence
  • hierarchical form phrases, sections, and movements governed by logical repetitions, variations, and contrasts
  • Recombinancy is a method for producing new and logical, i.e. musically logical, collections of musical events (i.e. new compositions) by recombining existing data into new logical orders on the basis of the rules which have been acquired through analysis of specific works or bodies of work (as distinct from the imposition of generic rules).
  • the Emmy algorithm assumes that every work, or stylistically consistent body of music, contains an implicit set of instructions, or rules, for creating different but highly-related replications of itself. Consequently, recombinancy, based on rules acquisition (as distinct from rules imposition) provides logical and successful approaches to composing new, highly-related replications of the original work(s).
  • SPEAC analysis One of the most important impacts that SPEAC analysis has on algorithmic composition involves the order in which groupings of events are selected and embedded in new compositions in a way which is faithful to the sets of instructions, both formal (i.e. discovered by pattern-matching analysis), and context sensitive (i.e. identified and evaluated by SPEAC analysis), that have been acquired from the database.
  • SPEAC allows for a nonlinear approach to recombinant composition.
  • Significant (relatively high SPEAC values) antecedent (A) and statement (S) groupings of a new work are selected first, and the remaining groupings follow in SPEAC priority order.
  • key components (groupings of musical events) of a new work may appear in many places within an overall formal structure initially, simultaneously, rather than appearing first at the beginning of a new work in progress and continuing to be added until the end is reached, in a linear manner. This nonlinear process closely resembles how human composers create large-scale works, by envisioning an overall form, or structure, for the work, and then progressively filling in the details.
  • the non-linear recombination step 140 of the Emmy software algorithm is founded on the principles of Augmented Transition Networks (ATNs) widely employed in computational linguistics and adapted in the Emmy software for processing music instead of language.
  • ATNs Augmented Transition Networks
  • the implementation of ATNs in the Emmy software is highly complex, in large part because music, unlike spoken language, has few universal rules of syntax (grammatical rules). Short sequences of musical events do not have commonly-agreed meanings (comparable to the dictionary definition of a part-of-speech made up of letters of the alphabet). Nor do longer musical phrases have established meanings in the way that a sentence usually does. By way of illustration, many English speakers could readily explain how the sentence "The dog eats a bone.
  • Emmy software focuses on rules acquisition from the musical database in use. This enables the Emmy software to powerfully analyze almost any musical database and recompose in any style, rather than be restricted to a single style by a single set of rules.
  • the output 150 of the Emmy software is a musical database with an entire new composition, stylistically faithful to compositions in the original database and resembling them, derived from them but not replicating them, resulting from the integration and synthesis of the process steps described above. It can be manifested as a score to be played by musicians, a digital file which can be input into an electromechanical device, such as a MIDI- enabled instrument or sound synthesizer, or readily converted into any other form of musical expression or performance.
  • an electromechanical device such as a MIDI- enabled instrument or sound synthesizer
  • Emmy software has many advantages in many environments, it is complex and requires considerable memory just to store the executable code. Due to its high- level, AI functions, linguistics-based ATNs and other expert systems, it is also computationally intensive, requiring a very large number of computing cycles to execute the encoded instructions of the algorithm. It is therefore highly unsuited to the rapid, recombinant, re-composition of short musical works, such as would be deployed telephone ringtones, musical toys, videogames, music boxes, and other similar applications requiring rapid and repetitive iterations of new music based on existing bodies of music.
  • the present invention provides a recombinant music composition algorithm and method of using the same.
  • the present invention provides a linear retrograde recombinant music composition algorithm and method of using the same.
  • a method for composing a new musical work based upon a plurality of existing musical work segments Upon providing a plurality of existing musical work segments, a final cadence segment of the new musical work is selected, and thereafter in a retrograde manner, a plurality of musical work segments are also selected.
  • the new musical work is created from end to beginning.
  • FIG. 1 illustrates a flowchart of a prior art musical composition algorithm system
  • FIG. 2 illustrates a flowchart of a musical composition algorithm system according to the present invention
  • FIG. 3 illustrates an example of retrograde recombination according to the present invention.
  • Figs. 4(a) and 4(b)l-2 illustrate a sample music database, and the music corresponding thereto.
  • the linear retrograde recombinant music composition algorithm software according to the present invention is an algorithm which is suited to the rapid re-composition of musical works, especially relatively short musical works, and is preferably written in software that can be deployed in digital, electronic and electro-mechanical devices such as telephone ringtones, musical toys, music boxes, videogames, and other similar applications, already existing or as yet undeveloped, requiring rapid and repetitive iterations of new music based on existing bodies of music.
  • LRRMCA uses simple retrograde recombinant algorithms to eliminate computational intensity in order to quickly produce an output that is stylistically faithful to and resembles existing bodies of music, derived from but not replicating them.
  • the fundamental algorithmic sequence of the LRRMCA can be represented by the logic flow illustrated in Figure 2, which shows, from a music database 200, segmentation step 210, retrograde recombination step 220, which result in the output 230. These steps performed by the LRRMCA are now described in more detail.
  • the music database 200 in LRRMCA has essentially the same aspects as described in the Emmy algorithm software above. Musical events are assigned values for the five basic parameters - on-time, pitch, duration, channel, and dynamics of the five basic parameters at least the first three, on-time, pitch and duration, are essential; the fourth, channel, is necessary for polyphony; and the fifth only if dynamics are a desired quality of the output.
  • the segmentation step 210 in LRRMCA is fundamentally the same as in the Emmy algorithm, with beats forming the basic increment of segmentation of the source works
  • a segment will only incorporate information with respect to the list of events pertaining to the notes in the segment itself and destination note information with respect to the destination notes for the notes in the beat. Thus, a segment will often constitute as little as a single beat of the source music, although longer segments may optionally be selected.
  • destination-note information is preferably the information about the last events in a beat that is selected to precede the current segment during the creation of the new score.
  • SPEAC signature collection and structural
  • retrograde recombination step 220 can occur.
  • retrograde recombination step 220 is performed without recourse to hierarchical SPEAC analysis or ATNs. This retrograde process is founded in the observation that, in the great majority of tonal-music forms and traditions, the end of a musical work is far more critical to the listener's perception that the work makes musical sense than is its beginning. The constraints governing the beginnings of musical phrases are significantly less bound by musical logic and convention than the ends of musical phrases.
  • LRRMCA begins a new musical work by selecting, either randomly or using an algorithm or process as described below, a segment that represents a cadence in its database of existing works. A cadence in music often occurs at the end of a phrase and almost invariably at the end of a work. Through application of such simple criteria cadences are easily identified and can thus be selected to initiate the composition process.
  • LRRMCA in the retrograde recomposition step 220, then composes backwards from the selected cadence, selecting and accumulating segments which have ending notes that correspond with the destination-note information included in the previously selected segment. This process therefore unfolds in a linear, retrograde fashion.
  • An example that illustrates the retrograde recomposition step 220 is diagrammed in Fig. 3, which shows four musical segments of varying lengths, 300-1, 300-2, 300-3, and 300-4, from an initial piece of music 300. Each of these can be construed as a combination of a one-beat-long piece of destination-note information that precedes the remaining beats, or alternatively, initial beats with the destination note information at the end.
  • LRRMCA first selects a cadence segment, in this example the final beat of segment 300-4, in order to begin the process. It then seeks any segment which ends with an end beat corresponding the destination-note first beat of the segment 300-4. In this case it selects segment 300-3, which fulfills this condition. Segment 300-3 is then added to segment 300-4 (removing the destination note and destination note information as discussed above).
  • code that determines the length of the composition for instance, sixteen beats
  • code that prevents the output from exactly replicating the initial piece of music code that prevents the output from exactly replicating prior output
  • code that repeats the output (“looping”
  • code that can connect one piece of musical output with another, different piece of musical output creating longer structures for instance, conventional musical structures such as ABA, ABBA, ABABA, etc.
  • the LRRMCA algorithm does not utilize SPEAC analysis, or any other hierarchical analysis processes requiring or required by non-linear recombination for recomposition purposes; nor are ATNs employed. Note also that no pattern-matching operations have been applied to the database to extract forms to act as skeletons for the new composition(s), nor have signature been extracted for re-embedding, intact, in the new composition(s).
  • the output 230 of LRRMCA process is a phrase-length new musical composition based on existing musical phrases in the musical database, stylistically faithful to them and resembling them, derived from them but not replicating them.
  • LRRMCA output, 230 is a short piece of music, or musical motive — typically but not necessarily from 16 to 32 beats in length
  • Such output along with the computational efficiency and compactness of the algorithm, makes LRRMCA written in software particularly suitable for implementation in electro-mechanical and electronic devices that generate musical works and, or, that produce rapid, and in some cases repetitive, iterations of new music, based on existing bodies of music.
  • Such devices include, but are not limited to, telephone ringtones, musical toys, music boxes, computer-video games, and music-composition workstations.
  • such devices may constitute LRRMCA code executable on the operating system of a computer, or LRRMCA may be embedded in tangible form as hardware in integrated electronic circuitry and, or, "chips," or any other technological process or device, or combination thereof, capable of inputting musical data, executing LRRMCA, and outputting musical data.
  • Figs. 4(a) and 4(b)l-2 illustrate a sample music database, and the music corresponding thereto.

Abstract

The present invention provides a retrograde recombinant composition algorithm that creates new musical compositions based on existing musical compositions that are preferably written in software and is suitable for implementation in electro-mechanical and electronic devices that generate musical works based on existing bodies of music. The retrograde approach to recomposition according to the present invention provides a highly simplified code that executes at a high speed, and accordingly a reduced need for computational resources.

Description

RECOMBINANT MUSIC COMPOSITION ALGORITHM AND METHOD OF USING THE SAME
Field of the Invention
[0001] The present invention provides a recombinant music composition algorithm and method of using the same, and, more particularly a linear retrograde recombinant music composition algorithm and method of using the same.
Background of the Invention
[0002] The practice of algorithmic composition has a long history ranging from mechanical devices (such as wind chimes and automata), through musical dice games (Musicalisches Wurfelschpiel, attributed to Mozart among others), mathematical, statistical, random and stochastic composition (e.g. the works of Iannis Xenakis) to computational software code and programs such as Cybernetic Composer by Charles Ames (see Ames, Charles. 1987. "Automated Composition in Retrospect: 1956-1986." Leonardo 20/2: 169- 185; 1989. "The Markov Process as a Compositional Model: A Survey and Tutorial." Leonardo 22/2: 175-187.). These antecedents have been extensively surveyed in Computers and Musical Style (Cope 1991. Computers and Musical Style. Madison, WI: A-R Editions, pp 1-18).
[0003] While there are various patents that have aspects related to the topic of music composition, the inventor's own prior work is of more significance relative to the present invention. In particular, the inventors own work, commonly known as the "Emmy Algorithm" or "Emmy", is a software package that uses recombinant algorithmic composition and has been taught through the publication of volumes of work, including books and articles, by David Cope, including:
1991a. Computers and Musical Style. Madison, WI: A-R Editions;
1991b. "Recombinant Music." Computer Music Journal 2AIl: 22-28;
1992. "Computer Modeling of Musical Intelligence in EMI." Computer Music Journal 16/2: 69-83;
1996. Experiments in Musical Intelligence. Madison, WI: A-R Editions;
2000. The Algorithmic Composer. Madison, WI: A-R Editions;
2001. Virtual Music. Cambridge, MA: MIT Press; 2003. "Computer Analysis of Musical Allusions." Computer Music Journal 211 \: 11-28;
2006. Computer Models of Musical Creativity. Cambridge, MA: MIT Press; [0004] The process used within the Emmy software has been referred to throughout the inventor's work generically as Experiments in Musical Intelligence. The fundamental algorithmic sequence of the Emmy software can be represented by the logic flow illustrated in Figure 1, which shows, from a music database 100, operations that include pattern matching step 110, segmentation step 120, hierarchical analysis step 130, non-linear recombination step 140, which result in the output 150.
[0005] The music database 100 is essentially the embodiment of a musical composition, or musical performance, in a tangible or legible form, format, language, or code that can be interpreted and executed by devices such as a computer, a musical instrument with a digital interface, a sound synthesizer, digital-to-analog audio reproduction system, or any combination of electrical, electro-mechanical, or mechanical musical devices. For instance, a musical score is, in and of itself a musical database of a composition, and a phonograph recording is an analog musical database of a performance. In an Emmy database, the groupings of notes in a musical phrase, or more precisely the "events" in a musical phrase, are assigned numerical values according to their pitch, duration, location in the work, voice, amplitude and/or other sonic and temporal qualities which characterize the notes. A single event is the grouping of notes which constitute a single beat in a musical work. Collections of numerical values of notes are compiled to represent successively longer measures, phrases, sections, and so forth. These compilations are called event lists, and are susceptible to processing by digital list-processing computer applications (such as the computer language known as LISP, short for List Processor).
[0006] Event lists describe the various attributes of each note with a single list of parameters of at least five separate but related elements, as follows:
[0007] The first element of the event list is the on-time, or the time elapsed between the beginning of the work and the initiation of the note. On-times are assigned numerical values in Emmy based on a standard metric of 1,000 ticks per second, which is usually equated with the length of a quarter note. On-times are relative, not absolute. As with printed music, the actual on-time of a pitch is determined by a combination of on-time (location in the score) and tempo (pace of playing). For example, an on-time of 1,000 could begin 1 second after 0 with a tempo of m.m. 60, 2 seconds after 0 with a tempo of m.m. 30, half a second after 0 with a tempo of m.m. 120, and so forth. Events describe only sound eventsinotQ ons and note offs), not silences or rests, relieving databases of vast amounts of unnecessary data. Silences, or rests, are represented by default as the result of a lack of events.
10008] The second entry of the event list is pitch. In Emmy, pitch is assigned a numerical value using the established Musical Instrument Digital Interface ("MIDI") standard, with middle C (520 cycles per second) equal to MIDI note number 60. Additions and subtractions of 12 produce C in various octaves, and additions and subtractions of 1 create half steps. Thus, the numerical sequence 60-62-64-65-67-69-71-72 represents the C major scale with intervening numbers (61,63,66,68,70) producing chromaticism to that key. [0009] The third entry of the event list is duration. Duration, as with on-time, is figured to a quarter note's equaling 1,000 ticks; relative durations are figured from that standard. The duration of an event implies the MIDI note off-time, which can be independently figured as the addition of the on-time plus the duration. Thus, an event with an on-time of 6,000 and a duration of 1,000 has an off-time of 7,000. Duration, as with on-time, is relative, being a factor of its value within the current tempo.
[00010] The fourth entry of the event list is channel number. The channel numbers indicate the original voice separation of the music entered into that database(e.g. : soprano, alto, tenor, bass; or, trumpet, saxophone, guitar, drums, etc.). Channel numbers are used to indicate the voice from which events were harvested or will be assigned for performance in the score of the new composition or, perhaps, for performance by a digitally enabled instrument (e.g. an instrument compatible with the industry-standard Musical Instrument Digital Interface). Channel numbers are theoretically unlimited, but in practice, 64 channels are sufficient for most music.
[00011] The fifth entry of the event list represents dynamics. Dynamics are based on 0 equaling silence and 127 equaling fortissimo, with the numbers between these values being relative to these extremes.
100012] Numbering systems, while logical are arbitrary and therefore many alternative numbering systems are possible. For instance, a base metric of 10,000 ticks per second could be used for event duration, and a scale of 0 for silence to 254 for fortissimo. Additional entries, a sixth, seventh and so on, into databases may be made as needed for other musical qualities and quantities pertaining to musical notes or events, such as tremolo, aftertouch, and so forth. Events are open-ended; that is, one may add any desired parameter to the end of event lists with no ill effects on the first five elements.Events are compiled into collections of larger phrase, section, or work, lists. Events are typically ordered sequentially (i.e. beat one, then beat two, and so forth) to make visual event reading simple and logical. Databases can be created by manually translating scores into event lists, or by software that automatically scans printed scores (sheet music) and translates them into events lists, by performing the work through a digitally-enabled instrument (sometimes referred to as "step entry") or by software that automatically analyzes performed music and translates it into events lists, or any combination of these techniques.
[00013] As with any large collection of data, in order for a database incorporating many musical events to be manageable it is beneficial to clarify, or make the data homogeneous in ways that preserve its essential characteristics and variety while facilitating analysis and processing. For instance, all works may be transposed into the same key signature, tempo, and so forth, without radically altering their distinctive melodic and harmonic characteristics and intervalic relationships between their notes. The precise format of the list will be determined by the type of application which will be used to process the data. An example of an event list written in a form suitable for use in conjunction with a computational list processing application, such a LISP, is provide in Figure 2a.
[00014] The Emmy algorithm assumes that every work of music contains an inherent set of instructions, or rules, for creating different but highly-related replications of itself -- an assumption which is generally agreed to by musicologists. These instructions, when analyzed and interpreted correctly, lead to important discoveries about this music's structure as well as providing a key to producing new instances of music that are stylistically-faithful to it. [00015] The pattern matching step 1 10 is the process of comparing events lists representing musical works or phrases in the musical database to discover what elements they have in common. Highly recurrent patterns in a single work typically represent thematic material, such as a particular melodic line and associated chord progression. However, patterns which recur in more than one work can be construed as the essence of the style of a particular composer or genre. Style is inherent in recurrent patterns of the relationships between the musical events, in more than one work. The primary constituents of these patterns are the quantities and qualities captured and represented in the musical database event lists - essentially pitch, duration, and temporal location in the work — although other factors such as dynamics and timbre may come into play. Patterns may be discerned in vertical, simultaneous relationships, such as harmony, horizontal, time-based relationships, such as melody, as well as amplitude-based relationships (dynamics) and timbral relationships. Patterns might be identical, almost identical, identical but reversed, identical but inverted, similar but not identical, and so forth. The Emmy algorithm searches the databases for such patterns using controllers that either restrict the search to detecting patterns that are highly similar, or widen the search to detect patterns that are loosely similar. The essence of this process is to reiteratively select the event list of differing portions of the music and look for other instances of the same, or similar, events lists elsewhere in the database, and to compile catalogues of matching events lists, ranking them by frequency of occurrence, type, and degree of similarity. The objective of this search, whether the pattern- matching net is cast tightly or widely, is to detect patterns that characterize the commonalities, or "style," of the bodies of music in the musical databases. [00016] Matches that are long in duration and loosely similar, for instance, characterize the form which the works in the database share. In a rudimentary example, the basic twelve-bar blues form - (AAB) — would be discovered and registered as a match in a database containing several blues. Formal patterns, generally of long duration, will often have widely varied content within the components of the pattern. The pattern matching controllers are therefore set to discover and compare larger musical structures while de-emphasizing or ignoring the details within the form. (By analogy, a poetry-form pattern matcher would discover the sonnet form by finding commonalities in the number of lines, meter, and rhyming scheme, while ignoring the words. Even if some sonnets in the database were in English and others were in Italian, the form uniting them could be discovered.) [00017] Matched patterns of shorter duration, such as beats, measures and short phrases, are also sought and catalogued. These commonalities are denominated "signatures" in Emmy. In the recomposition process, signatures are preserved and serve to ensure that stylistic qualities are inherent in the musical output.
[00018] In the Emmy algorithm, superficial, thematic material specific to a. particular work should not be mistaken for deeper commonalities shared by many works by the same composer, or in the same genre. For instance the "di-di-di-dah" motif of Beethoven's Fifth Symphony is a thematic component specific to that work and is not a signature that is found in very many or all of Beethoven's work. The pattern-matching controllers in Emmy can be adjusted to reject thematic material as superficial and irrelevant to the discovery of signatures. This is achieved by rejecting matches that occur with relatively high frequency in a single work but occur with relatively low frequency, or are entirely absent, in other works. [00019] The essential outcomes of the pattern-matching step 1 10 are two-fold: long forms present in the source material, particularly forms of musical-phrase length and larger, are identified for use as templates for future recomposition; and stylistic signatures are captured so that instances of them can be protected (not broken apart) and re-implanted in the composition process.
[000201 In order to recombine music it must, self-evidently, be broken into constituent elements first. This process is referred to in the Emmy software as segmentation step 120. Segments, typically, consist of beats - the groupings of notes which correspond with one beat in the music. However, segmentation of existing musical works into smaller components, and haphazard recombination of them into new orders, would produce musical gibberish, as would fragmenting written language sentences into words and haphazardly recombining the words without regard to grammar (syntax) or meaning (semantics). Although segmentation in the Emmy software is fundamentally straightforward - the identification of each beat and its conversion into an event list - each segment will become progressively more complex as contextual analysis is applied to it. Each beat-segment will accumulate and carry with it at least the following information: the destination note for each note in the beat (i.e. the note in the corresponding voice which follows it in the original work); the grouping of beats, or phrase, to which it belongs; the location of the phrase within the work; its SPEAC value (see below); and whether it is part of a signature that will be protected and not broken apart in the re-composition process.
[00021] While the pattern matching step 110 analyzes form from an essentially syntactical point of reference, hierarchical, or SPEAC, analysis step 130 investigates the semantic structure of music and provides tools for ensuring that when music is recombined, syntactically correct music is also semantically intelligible.
100022] The differentiation of two or more apparently identical but functionally different musical events by analyzing the context in which they occur is extremely important in the Emmy algorithm. The musical function of a note, or chord, in a piece of music depends on its context, particularly the musical interval between the notes or chords that precede or follow it. This may not be intuitively obvious to a non-musician, but can be illustrated by an analogy. In spoken language, homonyms (same-spelled and spoken words) can have quite different functions and meanings, for instance in the sentence "I saw the saw saw." The word "saw" appears three times in this sentence, with each appearance having a different meaning and making a different syntactic contribution (subject verb, object noun, object verb, etc.) and semantic meaning (because we know that saws cannot see, we infer that the final appearance cannot be a part of the verb "to see" and must therefore refer to the act of sawing). Only the context distinguishes each word's true function and meaning. The same is may be said of music.
100023] Tonal-music leading tones provide an example of how hierarchical analysis differentiates between apparently identical functional motions in music. The leading-tone note in the key of C Major (B), for example, strongly leans toward the tonic note when found in dominant, dominant-seventh, and leading-tone harmonies. However, the same leading-tone note appearing as the fifth of the mediant triad does not necessarily lean toward the tonic note (C), but in fact often moves more naturally elsewhere — the submediant note (A), for example. Thus, the same leading-tone note can be analyzed differently depending on its context. This insight provides a very important foundation for Emmy approaches to structural analysis.
[000241 For these reasons the Emmy software adopts a hierarchical approach to musical analysis, which is based on a combination of musical tension and musical context that are analyzed, evaluated and assigned a numerical weighting. This weighting combination closely parallels the manner in which one hears music, almost regardless of its style or period of composition, and hence represents the core of the analysis component of Emmy composing programs. The hierarchical approach uses a process that goes by the acronym SPEAC, the acronym being based on the identifiers -Statement (S), Preparation (P), Extension (E), Antecedent (A), and Consequent (C) - which will be assigned to events and groupings of events. SPEAC analysis also parses these selected groupings of events to extract information about their role in increasingly large musical structures, from beats to measures, to phrases, sections, and even to whole works. While traditional tonal functions provide analysis of surface detail, the SPEAC approach provides deeper insights into musical structure. In other words, SPEAC derives musical meanings from context as well as from content. SPEAC identifiers function in the following ways: [00025] S = Statement; is stable — a declaration of material or ideas. Statements typically precede or follow any SPEAC function.
[00026] P = Preparation; is unstable — an introductory gesture. Preparations precede any
SPEAC function though more typically occur prior to statements and antecedents.
[00027] E = Extension; is stable — a continuation of material or ideas. Extensions usually follow statements but can follow any SPEAC function.
[00028] A = Antecedent; is very unstable — requires a consequent function. Antecedents typically precede consequents.
[00029] C = Consequent; is very stable — results in consequent gestures. Consequents must be preceded directly or indirectly (with intervening extensions) by antecedents.
[00030] SPEAC identifier assignments follow an A - P - E - S - C stability order with the most unstable identifier to the left, and the most stable identifier to the right. Therefore, A and P require resolution while E, S, and C do not. Thus, progressions of identifiers such as
PSEAC and SEA are musically logical, progressions such as AEPS and SAPC, while not impossible, are less logical. David Cope: "Algorithmic Composer" p. 194 provides an example of SPEAC analysis as applied to a Bach Chorale.
[00031] While these approaches to defining roots, tensions, and groupings seem logical in principle, methods of converting these roots and tensions into numerical values for contextual comparison and analysis is not obvious. To achieve conversion, the Emmy software uses an empirically derived formula:
/(x) = y + (cos ((-l * z) + x/z))/2 where x is the pitch-class interval, y represents the y coordinate, and z is a constant. This formula roughly accounts for the primary intervals (seconds, thirds, and fourths).
[00032] Secondary intervals (sixths and sevenths as inversions of thirds and seconds respectively) then approximately mirror the primary intervals, with the fifth is treated uniquely as it has very little tension. Intervals greater than an octave have slightly less (.02) tension than their related less-than-octave-separated equivalents, because of their octave separation. When this formula is applied to the intervals within several chords in a particular work, the relative totals produced by the formula will indicate what the SPEAC role of each chord is. For instance, a chord which produced a total of 0.5 would be the antecedent, "A," in the context of a preceding chord with a value of 0.2 and a succeeding chord with a value of 0.3; however, the identical chord with its value of 0.5 would be a consequent, "C," in the context of a preceding 0.8 and a succeeding 0.4. While the intervals in the event determine its fixed value, or weighting, the context in which the event occurs determine its relative position in the SPEAC hierarchy - both in the original works in the database and in recombined works.
[00033] After SPEAC hierarchical analysis step 130 has been performed, every event and grouping of events, will carry with it its SPEAC identifiers and weightings, which will be essential in order to accomplish musically-logical, structurally sound, and context-sensitive re-location of events and event-groupings in the recombination process. [00034] The Emmy software employs a recombination step 140, which is made possible by prior pattern-matching step 110, segmentation step 120, and prior SPEAC hierarchical analysis step 130 of the database 100. Non-linear recombination step 140 is the compositional process that synthesizes the results of the pattern-matching step 1 10 (form, and signature detection) and the hierarchical SPEAC analysis step 130 (context-sensitive, structural analysis) components of the Emmy algorithm.
[00035] Tonal music follows well-known principles governing pitch (notably major and minor scale derivation and complementary chromaticism), melody (primarily stepwise motion with leaps often followed by stepwise contrary motion), harmony (having prescribed functions and syntax), voice-leading (mostly stepwise motion with voice independence), hierarchical form (phrases, sections, and movements governed by logical repetitions, variations, and contrasts), and so on. One way to algorithmically create tonal music is by programming rules for each of these principles. Unfortunately, as has been shown (see David Cope: 2001a), this rules-based approach produces technically correct, but musically stale imitations. Additionally, this approach requires programming a new set of rules for every composer or genre of tonal music under consideration.
[00036] Recombinancy, on the other hand, is a method for producing new and logical, i.e. musically logical, collections of musical events (i.e. new compositions) by recombining existing data into new logical orders on the basis of the rules which have been acquired through analysis of specific works or bodies of work (as distinct from the imposition of generic rules).
[00037] Recombinancy appears ubiquitously as natural processes as well as a human creative process. As a simple human-creative example, all the great works of literature in the English language result from combination of the twenty-six letters of the alphabet into words, and recombination of those words. Similarly, most of the great works of Western art music consist of combinations of the twelve pitches of the equal-tempered scale, their octave equivalents, and the recombinations of groupings — melodies, harmonies, and so on — that result from these combinations.
[00038] As stated previously, the Emmy algorithm assumes that every work, or stylistically consistent body of music, contains an implicit set of instructions, or rules, for creating different but highly-related replications of itself. Consequently, recombinancy, based on rules acquisition (as distinct from rules imposition) provides logical and successful approaches to composing new, highly-related replications of the original work(s). [00039] One of the most important impacts that SPEAC analysis has on algorithmic composition involves the order in which groupings of events are selected and embedded in new compositions in a way which is faithful to the sets of instructions, both formal (i.e. discovered by pattern-matching analysis), and context sensitive (i.e. identified and evaluated by SPEAC analysis), that have been acquired from the database. SPEAC allows for a nonlinear approach to recombinant composition. Significant (relatively high SPEAC values) antecedent (A) and statement (S) groupings of a new work are selected first, and the remaining groupings follow in SPEAC priority order. In short, key components (groupings of musical events) of a new work may appear in many places within an overall formal structure initially, simultaneously, rather than appearing first at the beginning of a new work in progress and continuing to be added until the end is reached, in a linear manner. This nonlinear process closely resembles how human composers create large-scale works, by envisioning an overall form, or structure, for the work, and then progressively filling in the details.
[00040] The non-linear recombination step 140 of the Emmy software algorithm is founded on the principles of Augmented Transition Networks (ATNs) widely employed in computational linguistics and adapted in the Emmy software for processing music instead of language. The implementation of ATNs in the Emmy software is highly complex, in large part because music, unlike spoken language, has few universal rules of syntax (grammatical rules). Short sequences of musical events do not have commonly-agreed meanings (comparable to the dictionary definition of a part-of-speech made up of letters of the alphabet). Nor do longer musical phrases have established meanings in the way that a sentence usually does. By way of illustration, many English speakers could readily explain how the sentence "The dog eats a bone. " is grammatically-correct and also make sense, whereas the sentence "The dog sang a bone" although equally grammatically correct is illogical. But most listeners and even musicians would find it difficult if not impossible to explain why one piece of music might make less musical sense than another structurally similar one. However, music does share with language such qualities as form (poetic literary forms have much in common with musical forms) and style (the characteristics of certain writers can be recognized and to some extent defined). Music also has rules of syntax, but they are not universal. Rather, they are specific to genres, composers, and in some case even to individual works. Music is not a language, but many languages, and many families of languages with countless idioms and dialects. It is precisely because there is no universal set of rules that can be discovered in all music that the Emmy software focuses on rules acquisition from the musical database in use. This enables the Emmy software to powerfully analyze almost any musical database and recompose in any style, rather than be restricted to a single style by a single set of rules.
[00041] The output 150 of the Emmy software is a musical database with an entire new composition, stylistically faithful to compositions in the original database and resembling them, derived from them but not replicating them, resulting from the integration and synthesis of the process steps described above. It can be manifested as a score to be played by musicians, a digital file which can be input into an electromechanical device, such as a MIDI- enabled instrument or sound synthesizer, or readily converted into any other form of musical expression or performance.
[00042] While the Emmy software has many advantages in many environments, it is complex and requires considerable memory just to store the executable code. Due to its high- level, AI functions, linguistics-based ATNs and other expert systems, it is also computationally intensive, requiring a very large number of computing cycles to execute the encoded instructions of the algorithm. It is therefore highly unsuited to the rapid, recombinant, re-composition of short musical works, such as would be deployed telephone ringtones, musical toys, videogames, music boxes, and other similar applications requiring rapid and repetitive iterations of new music based on existing bodies of music.
Summary of the Invention 100043] In one aspect, the present invention provides a recombinant music composition algorithm and method of using the same.
100044) In another aspect, the present invention provides a linear retrograde recombinant music composition algorithm and method of using the same.
100045) In one embodiment, there is described a method for composing a new musical work based upon a plurality of existing musical work segments. Upon providing a plurality of existing musical work segments, a final cadence segment of the new musical work is selected, and thereafter in a retrograde manner, a plurality of musical work segments are also selected.
[000461 In a particular embodiment, the new musical work is created from end to beginning.
Brief Description of the Drawings
100047) These and other aspects and features of the present invention will become apparent to those of ordinary skill in the art upon review of the following description of specific embodiments of the invention in conjunction with the accompanying figures, wherein:
100048) Fig. 1 illustrates a flowchart of a prior art musical composition algorithm system;
100049) Fig. 2 illustrates a flowchart of a musical composition algorithm system according to the present invention;
100050) Fig. 3 illustrates an example of retrograde recombination according to the present invention; and
100051) Figs. 4(a) and 4(b)l-2 illustrate a sample music database, and the music corresponding thereto.
Detailed Description of the Preferred Embodiments
[00052] The linear retrograde recombinant music composition algorithm software according to the present invention (termed LRRMCA for shorthand herein), is an algorithm which is suited to the rapid re-composition of musical works, especially relatively short musical works, and is preferably written in software that can be deployed in digital, electronic and electro-mechanical devices such as telephone ringtones, musical toys, music boxes, videogames, and other similar applications, already existing or as yet undeveloped, requiring rapid and repetitive iterations of new music based on existing bodies of music. LRRMCA uses simple retrograde recombinant algorithms to eliminate computational intensity in order to quickly produce an output that is stylistically faithful to and resembles existing bodies of music, derived from but not replicating them. While the LRRMCA uses comparatively trivial computational resources to produce musical output, the output is less reliably faithful to the works in the musical database. However, LRRMCA's almost instantaneous speed of production compensates for any loss of verisimilitude, as does the ease and speed of disposability and replacement of undesirable output. In most applications, such as telephone ringtones, or the generation of advertising "jingles," unsatisfactory output can be discarded without significant cost.
[000531 The fundamental algorithmic sequence of the LRRMCA can be represented by the logic flow illustrated in Figure 2, which shows, from a music database 200, segmentation step 210, retrograde recombination step 220, which result in the output 230. These steps performed by the LRRMCA are now described in more detail. [00054] The music database 200 in LRRMCA has essentially the same aspects as described in the Emmy algorithm software above. Musical events are assigned values for the five basic parameters - on-time, pitch, duration, channel, and dynamics of the five basic parameters at least the first three, on-time, pitch and duration, are essential; the fourth, channel, is necessary for polyphony; and the fifth only if dynamics are a desired quality of the output.
[00055] The segmentation step 210 in LRRMCA is fundamentally the same as in the Emmy algorithm, with beats forming the basic increment of segmentation of the source works
[00056] During the creation of the music, various segments are connected together in order to achieve the new musical score. As will be described in more detail hereinafter, when connecting together two pieces of music, the note at the end of one piece and the note at the beginning of the next piece will be same, as described further herein (though this can also be viewed as the note at the beginning of one piece and the end of what will become the previous piece will be the same). As such, in the resulting work, either the note and the beginning of one piece or the end of the other piece will be dropped, though this note is included in the stored file in some manner. It can also be viewed as an extra note that will not be used in the resulting work. This note is termed herein destination note, and has associated with it destination note information, and is part of what is referred to herein as part of the segment, though it is a part of the segment that will not be included in the resulting work.
100057] A segment will only incorporate information with respect to the list of events pertaining to the notes in the segment itself and destination note information with respect to the destination notes for the notes in the beat. Thus, a segment will often constitute as little as a single beat of the source music, although longer segments may optionally be selected. In LRRMCA destination-note information is preferably the information about the last events in a beat that is selected to precede the current segment during the creation of the new score. Because pattern-matching, signature collection and structural (SPEAC) analysis do not take place in LRRMCA, the events lists which represent the segments in LRRMCA are very lean and contain only the information minimally necessary for retrograde recombination of short works lacking structural complexity. Since computational resources needed to analyze data grow in roughly exponential proportion to number of data elements simultaneously involved in the computation, this reduction in scale and complexity of the segmented database results in order-of-magnitude increases in speed of processing and, or, reduction in needed computational resources.
[00058] After segmentation step 210 is performed on a piece or pieces of music, retrograde recombination step 220 can occur. In contrast to the Emmy algorithm previously described, retrograde recombination step 220 is performed without recourse to hierarchical SPEAC analysis or ATNs. This retrograde process is founded in the observation that, in the great majority of tonal-music forms and traditions, the end of a musical work is far more critical to the listener's perception that the work makes musical sense than is its beginning. The constraints governing the beginnings of musical phrases are significantly less bound by musical logic and convention than the ends of musical phrases. This is because, whereas the listener has few, if any, previously developed expectation of what will happen when a piece of music begins, towards its end many expectations have been developed as the listener has accumulated awareness of certain structural characteristics such as key, time-signature, melodic and harmonic progression, and so forth, which collectively invoke musically- proscribed cadential resolutions. Hence, LRRMCA begins a new musical work by selecting, either randomly or using an algorithm or process as described below, a segment that represents a cadence in its database of existing works. A cadence in music often occurs at the end of a phrase and almost invariably at the end of a work. Through application of such simple criteria cadences are easily identified and can thus be selected to initiate the composition process. LRRMCA, in the retrograde recomposition step 220, then composes backwards from the selected cadence, selecting and accumulating segments which have ending notes that correspond with the destination-note information included in the previously selected segment. This process therefore unfolds in a linear, retrograde fashion. [00059] An example that illustrates the retrograde recomposition step 220 is diagrammed in Fig. 3, which shows four musical segments of varying lengths, 300-1, 300-2, 300-3, and 300-4, from an initial piece of music 300. Each of these can be construed as a combination of a one-beat-long piece of destination-note information that precedes the remaining beats, or alternatively, initial beats with the destination note information at the end. segment As illustrated, the end beats 300-le, 300-2e, and 300-3e in each of these segments 300-1, 300-2, and 300-3 exactly coincide with the destination note first beats 300-2f, 300-3f and 300-4f of the segments 300-2, 300-3 and 300-4. In the retrograde recomposition process, LRRMCA first selects a cadence segment, in this example the final beat of segment 300-4, in order to begin the process. It then seeks any segment which ends with an end beat corresponding the destination-note first beat of the segment 300-4. In this case it selects segment 300-3, which fulfills this condition. Segment 300-3 is then added to segment 300-4 (removing the destination note and destination note information as discussed above). To avoid undesirable repetition, the end beat of 300-3 is eliminated, not used for some period of time, or processed with an algorithm implemented in software code of the type described above. This process is repeated until the final segment, 300-1, is accumulated. This example illustrates both the simplicity of, as well as the powerfulness of, the linear retrograde combination according to the present invention.
[00060] In a database consisting of numerous segments of an initial piece of music 300, there will typically be more than one instance of segments that have end beats that exactly coincide with first-beat destination notes of other segments. In Fig. 3, for instance, the end beats of both 300-1 and 300-3, i.e. 300-le and 300-3e, coincide exactly with the first beat of 300-4f. Therefore, segment 300-1 could have been selected instead of 300-3 to precede segment 300-4, as could any other segment with the same end beat. Random selection of which of the eligible beats will precede segment 300-4, i.e. 300-1 or 300-3, enables LRRMCA to produce varied musical output. Refinements to the retrograde recomposition step 220 illustrated in Fig. 3 that may be input by a user or embodied in software or hardware, include but are not limited to: code that determines the length of the composition (for instance, sixteen beats); code that prevents the output from exactly replicating the initial piece of music; code that prevents the output from exactly replicating prior output; code that repeats the output ("looping"); and code that can connect one piece of musical output with another, different piece of musical output creating longer structures (for instance, conventional musical structures such as ABA, ABBA, ABABA, etc.) and other refinements such as would be evident to one ordinarily skilled in the art of music composition. It is preferable to have the selection of segments, particularly the selection of segments after the initial (end) cadence segment, automatically selected based upon operation of the software or hardware, without requiring further intervention by the user.
[00061] Note that the LRRMCA algorithm does not utilize SPEAC analysis, or any other hierarchical analysis processes requiring or required by non-linear recombination for recomposition purposes; nor are ATNs employed. Note also that no pattern-matching operations have been applied to the database to extract forms to act as skeletons for the new composition(s), nor have signature been extracted for re-embedding, intact, in the new composition(s).
[00062] The output 230 of LRRMCA process is a phrase-length new musical composition based on existing musical phrases in the musical database, stylistically faithful to them and resembling them, derived from them but not replicating them. Generally short in duration and lacking complexity, LRRMCA output, 230, is a short piece of music, or musical motive — typically but not necessarily from 16 to 32 beats in length Such output, along with the computational efficiency and compactness of the algorithm, makes LRRMCA written in software particularly suitable for implementation in electro-mechanical and electronic devices that generate musical works and, or, that produce rapid, and in some cases repetitive, iterations of new music, based on existing bodies of music. Such devices include, but are not limited to, telephone ringtones, musical toys, music boxes, computer-video games, and music-composition workstations. Operationally, such devices may constitute LRRMCA code executable on the operating system of a computer, or LRRMCA may be embedded in tangible form as hardware in integrated electronic circuitry and, or, "chips," or any other technological process or device, or combination thereof, capable of inputting musical data, executing LRRMCA, and outputting musical data. [000631 Figs. 4(a) and 4(b)l-2 illustrate a sample music database, and the music corresponding thereto.
[000641 Although the present invention has been particularly described with reference to embodiments thereof, it should be readily apparent to those of ordinary skill in the art that various changes, modifications and substitutes are intended within the form and details thereof, without departing from the spirit and scope of the invention. For example, while the present invention is preferably implemented using a retrograde linear algorithm, the present invention can also be implemented to create a new musical work from beginning to end, as well as create a new musical work from both the beginning and the end, and then connects the beginning segments and the end segments together using a connecting musical work segment. Accordingly, it will be appreciated that in numerous instances some features of the invention will be employed without a corresponding use of other features. Further, those skilled in the art will understand that variations can be made in the number and arrangement of components illustrated in the above figures. It is intended that the scope of the appended claims include such changes and modifications.

Claims

What is claimed:
1. A method for composing a new musical work based upon a plurality of existing musical work segments comprising the steps of: providing the plurality of existing musical work segments, each of the existing musical work segments including pitch, duration, and on-time, each of the plurality of existing musical work segments further including a destination beat that is used to determine a correspondence with others of the plurality of existing musical work segments; selecting a final cadence segment of the new musical work from the plurality of existing musical work segments; and sequentially selecting, in a retrograde manner, a plurality of musical work segments from the plurality of existing musical work segments such that, for each sequentially selected segment, a last beat of the selected segment corresponds to a destination beat of a previously selected segment, wherein the final cadence segment is an initial previously selected segment.
2. The method according to claim 1 wherein the step of sequentially selecting selects the plurality of musical work segments such that the new musical work is composed from end to beginning.
3. The method according to claim 2 wherein the step of sequentially selecting occurs automatically.
4. The method according to claim 4 wherein the step of sequentially selecting selects at least 8 musical work segments.
5. The method according to claim 1 further including the steps of: selecting an initial segment of the new musical work from the plurality of existing musical work segments; sequentially selecting another plurality of musical work segments from the plurality of existing musical work segments such that, for each sequentially selected segment, a first beat of the selected segment corresponds to a destination beat of a subsequent segment, wherein the initial segment is an initial selected segment; and connecting the plurality of musical work segments and the another pluralitiy of musical work segments with a further musical work segment to create the new musical work.
6. The method according to claim 5 wherein the steps of sequentially selecting and connecting occur automatically.
7. The method according to claim 1 wherein the step of providing provides, for at least some of the existing musical work segments, a voice (channel) and dynamics of at least some of the notes.
8. The method according to claim 1 wherein the step of providing provides each of the plurality of existing musical work segments in a form that includes events lists for the notes of the beat or beats in the existing musical work segment as well as the events list for the notes of the beat that preceded the existing musical work segment in music from which that existing musical work segment came.
9. The method according to claim 1 wherein the step of sequentially selecting selects at least 8 musical work segments.
10. The method according to claim 1, wherein the step of providing provides the plurality of existing musical work segments in a database, and the step of sequentially selecting is implemented in software that is automatically executed by a processor.
1 1. The method according to claim 1, wherein the step of providing provides the plurality of existing musical work segments in a database, and the step of sequentially selecting is programmed into a circuit.
12. A method for composing a new musical work based upon a plurality of existing musical work segments comprising the steps of: providing the plurality of existing musical work segments, each of the existing musical work segments including pitch, duration, and on-time, each of the plurality of existing musical work segments further including a destination beat that is used to determine a correspondence with others of the plurality of existing musical work segments; selecting an initial segment of the new musical work from the plurality of existing musical work segments; and sequentially selecting a plurality of existing musical work segments such that, for each sequentially selected segment, a first beat of the selected segment corresponds to a destination beat of a subsequent segment, wherein the initial segment is an initial selected segment, and whereby the new musical work is thus composed from beginning to end and the new musical work terminates on a cadence segment.
13. The method according to claim 12 wherein the step of sequentially selecting occurs automatically.
14. The method according to claim 12 wherein the step of sequentially selecting selects at least 8 musical work segments.
PCT/US2007/088212 2006-12-19 2007-12-19 Recombinant music coposition algorithm and method of using the same WO2008077128A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/613,097 US7696426B2 (en) 2006-12-19 2006-12-19 Recombinant music composition algorithm and method of using the same
US11/613,097 2006-12-19

Publications (3)

Publication Number Publication Date
WO2008077128A2 true WO2008077128A2 (en) 2008-06-26
WO2008077128A3 WO2008077128A3 (en) 2008-09-12
WO2008077128A9 WO2008077128A9 (en) 2008-10-23

Family

ID=39525574

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/088212 WO2008077128A2 (en) 2006-12-19 2007-12-19 Recombinant music coposition algorithm and method of using the same

Country Status (2)

Country Link
US (1) US7696426B2 (en)
WO (1) WO2008077128A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9620092B2 (en) 2012-12-21 2017-04-11 The Hong Kong University Of Science And Technology Composition using correlation between melody and lyrics

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9818386B2 (en) * 1999-10-19 2017-11-14 Medialab Solutions Corp. Interactive digital music recorder and player
US8487176B1 (en) 2001-11-06 2013-07-16 James W. Wieder Music and sound that varies from one playback to another playback
US20090078108A1 (en) * 2007-09-20 2009-03-26 Rick Rowe Musical composition system and method
JP5228432B2 (en) * 2007-10-10 2013-07-03 ヤマハ株式会社 Segment search apparatus and program
US8890869B2 (en) * 2008-08-12 2014-11-18 Adobe Systems Incorporated Colorization of audio segments
US9153217B2 (en) * 2010-11-01 2015-10-06 James W. Wieder Simultaneously playing sound-segments to find and act-upon a composition
US8484229B2 (en) * 2011-05-11 2013-07-09 King Abdulaziz City For Science And Technology Method and system for identifying traditional arabic poems
US20130125732A1 (en) * 2011-11-21 2013-05-23 Paul Nho Nguyen Methods to Create New Melodies and Music From Existing Source
US9721551B2 (en) * 2015-09-29 2017-08-01 Amper Music, Inc. Machines, systems, processes for automated music composition and generation employing linguistic and/or graphical icon based musical experience descriptions
US10854180B2 (en) 2015-09-29 2020-12-01 Amper Music, Inc. Method of and system for controlling the qualities of musical energy embodied in and expressed by digital music to be automatically composed and generated by an automated music composition and generation engine
KR101942814B1 (en) * 2017-08-10 2019-01-29 주식회사 쿨잼컴퍼니 Method for providing accompaniment based on user humming melody and apparatus for the same
US11328700B2 (en) 2018-11-15 2022-05-10 Sony Interactive Entertainment LLC Dynamic music modification
US11037538B2 (en) 2019-10-15 2021-06-15 Shutterstock, Inc. Method of and system for automated musical arrangement and musical instrument performance style transformation supported within an automated music performance system
US10964299B1 (en) 2019-10-15 2021-03-30 Shutterstock, Inc. Method of and system for automatically generating digital performances of music compositions using notes selected from virtual musical instruments based on the music-theoretic states of the music compositions
US11024275B2 (en) 2019-10-15 2021-06-01 Shutterstock, Inc. Method of digitally performing a music composition using virtual musical instruments having performance logic executing within a virtual musical instrument (VMI) library management system
US11615772B2 (en) * 2020-01-31 2023-03-28 Obeebo Labs Ltd. Systems, devices, and methods for musical catalog amplification services
WO2021202868A1 (en) * 2020-04-02 2021-10-07 Sony Interactive Entertainment LLC Dynamic music modification
US11183160B1 (en) * 2021-02-16 2021-11-23 Wonder Inventions, Llc Musical composition file generation and management system
US11790876B1 (en) * 2022-06-01 2023-10-17 Library X Music Inc. Music technique responsible for versioning

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5099740A (en) * 1987-04-08 1992-03-31 Casio Computer Co., Ltd. Automatic composer for forming rhythm patterns and entire musical pieces
US20010025561A1 (en) * 1998-02-19 2001-10-04 Milburn Andy M. Method and apparatus for composing original works
US20020033090A1 (en) * 2000-09-20 2002-03-21 Yamaha Corporation System and method for assisting in composing music by means of musical template data
US6740802B1 (en) * 2000-09-06 2004-05-25 Bernard H. Browne, Jr. Instant musician, recording artist and composer

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4982643A (en) * 1987-12-24 1991-01-08 Casio Computer Co., Ltd. Automatic composer
US6835886B2 (en) * 2001-11-19 2004-12-28 Yamaha Corporation Tone synthesis apparatus and method for synthesizing an envelope on the basis of a segment template
US6897368B2 (en) * 2002-11-12 2005-05-24 Alain Georges Systems and methods for creating, modifying, interacting with and playing musical compositions
JP4107107B2 (en) * 2003-02-28 2008-06-25 ヤマハ株式会社 Keyboard instrument

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5099740A (en) * 1987-04-08 1992-03-31 Casio Computer Co., Ltd. Automatic composer for forming rhythm patterns and entire musical pieces
US20010025561A1 (en) * 1998-02-19 2001-10-04 Milburn Andy M. Method and apparatus for composing original works
US6740802B1 (en) * 2000-09-06 2004-05-25 Bernard H. Browne, Jr. Instant musician, recording artist and composer
US20020033090A1 (en) * 2000-09-20 2002-03-21 Yamaha Corporation System and method for assisting in composing music by means of musical template data

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9620092B2 (en) 2012-12-21 2017-04-11 The Hong Kong University Of Science And Technology Composition using correlation between melody and lyrics

Also Published As

Publication number Publication date
WO2008077128A9 (en) 2008-10-23
US7696426B2 (en) 2010-04-13
US20080141850A1 (en) 2008-06-19
WO2008077128A3 (en) 2008-09-12

Similar Documents

Publication Publication Date Title
US7696426B2 (en) Recombinant music composition algorithm and method of using the same
US6225546B1 (en) Method and apparatus for music summarization and creation of audio summaries
US5883326A (en) Music composition
US6297439B1 (en) System and method for automatic music generation using a neural network architecture
Ponsford et al. Statistical learning of harmonic movement
Dubnov et al. Using machine-learning methods for musical style modeling
US20120144978A1 (en) System and Method For Adaptive Melodic Segmentation and Motivic Identification
CN112382257B (en) Audio processing method, device, equipment and medium
Lindemann Music synthesis with reconstructive phrase modeling
Langston Six techniques for algorithmic music composition
Granroth-Wilding et al. Harmonic analysis of music using combinatory categorial grammar
Cope Recombinant music: using the computer to explore musical style
Hirata et al. Interactive music summarization based on generative theory of tonal music
Dannenberg Style in music
CN113010730A (en) Music file generation method, device, equipment and storage medium
US6313390B1 (en) Method for automatically controlling electronic musical devices by means of real-time construction and search of a multi-level data structure
Hirata et al. Interactive Music Summarization based on GTTM.
de Mantaras Making music with AI: Some examples
Trochidis et al. CAMeL: Carnatic percussion music generation using n-gram models
Dubnov et al. Deep and shallow: Machine learning in music and audio
Drewes et al. An algebra for tree-based music generation
Winter Interactive music: Compositional techniques for communicating different emotional qualities
Hoshi et al. Versatile Automatic Piano Reduction Generation System by Deep Learning
Eigenfeldt The human fingerprint in machine generated music
Amagasu et al. Tokenization of MIDI Sequences for Transcription

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07871716

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07871716

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)