WO2002019164A1 - Virtual aggregate fields - Google Patents

Virtual aggregate fields Download PDF

Info

Publication number
WO2002019164A1
WO2002019164A1 PCT/US2001/011418 US0111418W WO0219164A1 WO 2002019164 A1 WO2002019164 A1 WO 2002019164A1 US 0111418 W US0111418 W US 0111418W WO 0219164 A1 WO0219164 A1 WO 0219164A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
fields
data
aggregate
virtual
Prior art date
Application number
PCT/US2001/011418
Other languages
French (fr)
Inventor
Walter Lindsay
Original Assignee
Contivo, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Contivo, Inc. filed Critical Contivo, Inc.
Priority to AU2001251451A priority Critical patent/AU2001251451A1/en
Priority to CA002420788A priority patent/CA2420788A1/en
Priority to EP01924833A priority patent/EP1330735A1/en
Priority to JP2002523205A priority patent/JP2004507840A/en
Publication of WO2002019164A1 publication Critical patent/WO2002019164A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/258Data format conversion from or to a database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99951File or database maintenance
    • Y10S707/99952Coherency, e.g. same view to multiple users
    • Y10S707/99954Version management

Definitions

  • the invention is related to the field of representation and translation of electronic documents.
  • Location is the DUNS_Number with the Store_Number concatenated to it.
  • a "meaning" is part of multiple fields in Documentl.
  • mapping Documentl mapping Documentl to Document2.
  • Customerl writes code like:
  • mappings in a document can have more than one meaning. This means that mapping is hard to automate. Instead, the mappings must be manually done and require customized code, which does not allow reuse of mapping knowledge and rules.
  • mapping and the mapping rules are one-off. That is, each time a user wants to define how to perform a document translation, similar code must be written and tested. This increases the time needed to define how to translate from the source to the target document.
  • mapping and the mapping rules depend on user- written code. This makes it hard to automatically validate the integrity of the mapping. It also sets a minimum bar for the skill level of anyone trying to define a mapping, as they must then know all the document locations that might hold a particular meaning, and must be skilful enough to write the code to handle the case. This imposes a maintenance burden, as fixing a problem in a mapping requires altering code.
  • mapping and the mapping rules are translation-language dependent.
  • the code that must be written and tested depends on the underlying translation engine that will translate the documents.
  • mapping rules will be translation-engine dependent, and that a translation defined for one translation engine will likely need adjusting to make the mapping work on a different translation engine. Moving a transform from one translation engine to another is difficult.
  • the source and target mappings must be significantly different.
  • the code for handling the case described above will differ whether the document is the source or the target document. If one has mapped from A to B, mapping from B to A requires major rework, as the code for the mapping would have to be rewritten using different logic.
  • a method comprising defining one or more aggregate virtual fields for a first document using meta-data associated with the first document is disclosed.
  • Figure 1 illustrates one embodiment of a data structure for part of a document.
  • Figure 2 illustrates another embodiment of a data structure for the same document as Figure 1.
  • Figure 3 illustrates one embodiment of a data structure including the information needed to create the virtual aggregate field of Figure 2.
  • Figure 4 is an example of a network that uses aggregate virtual fields to translate documents.
  • Figure 5 is an example of a computer system that uses aggregate virtual fields to translate documents.
  • Figure 6 is an example of a translation system that uses aggregate virtual fields to translate documents.
  • Figure 7 is an embodiment of a method for automatically generating a transform using aggregate virtual fields.
  • a transform to perform mappings can be automatically generated using virtual aggregate fields.
  • the virtual aggregate fields can be used to create a named field in the document, and associate it with a set of fields with a "map- to" and/or a "map-from” rule. All mapping and mapping rule operations can then be applied on the new field.
  • the transform to perform the mappings can then be automatically generated, as discussed below.
  • Figure 1 depicts a data structure for part of a document. Groupl has other groups under it, GroupA, GroupB and GroupC. GroupB has other groups under it, including Groupl, which has field Fieldl.3 in it. The full path through this document, from the top as shown down to Fieldl.3, is described as:
  • Fieldl.3 and Field3.2 hold information that together represent a "meaning" of the document. It is important to realize that in many concrete documents a structure like GroupB can repeat many times.
  • Figure 1 is an example of one embodiment of meta-data.
  • GroupB can appear multiple times. This is a particular instance of GroupB. Each instance of GroupB can have different data in its fields.
  • Figure 2 depicts a data structure for the same document as Figure 1, except that field F_Meaningl defined. That is, if a mapping accesses field F_Meaningl in a source document, the transform will automatically use the MapFrom code. If a mapping accesses field F_Meaningl in a target document, the transform will automatically use the MapTo code.
  • F_Meaningl is an enabled virtual aggregate field. That is, it appears in the document. There are events, such as the user in the GUI, that trigger the virtual aggregate field to be enabled - inserted into the document. Similarly, a virtual aggregate field can be disabled and disappear from the document.
  • Figure 3 illustrates a data structure including the information needed to create the virtual aggregate field of Figure 2.
  • the information needed to generate a virtual group is:
  • ParticipatingFields the fields that hold the source data of the new field.
  • mapping rules can be applied to meta-data to map from a field under a virtual aggregate field in the source document to the relevant location(s) in the target document.
  • a virtual aggregate field in a source document can be treated just like any other field. Whatever operations - move, or any other mapping rule that might be applied to other fields - apply to such fields.
  • a transform is the code used by a translation engine to convert one concrete document to another.
  • a transform is generated by applying mapping rules to the meta-data of the source and target documents. After the mapping rules and the meta-data, including virtual aggregate fields, are defined, a transform can be automatically generated, which performs the following processing on fields in a virtual aggregate field defined for a concrete source document:
  • a virtual aggregate field in a target document can be treated just like any other field. Whatever operations - move, or any other manipulation rule that might be applied to other fields - apply to virtual aggregate fields.
  • a transform can be automatically generated, which will do this processing on virtual aggregate fields defined for a concrete target document:
  • a host computer system transmits and receives data over a computer network or standard telephone line.
  • the steps of accessing, downloading, and manipulating the data, as well as other aspects of the present invention are implemented by a central processing unit (CPU) in the host computer executing sequences of instructions stored in a memory.
  • the memory may be a random access memory (RAM), read-only memory (ROM), a persistent store, such as a mass storage device, or any combination of these devices.
  • Execution of the sequences of instructions causes the CPU to perform steps according to the present invention.
  • the instructions may be loaded into the memory of the host computer from a storage device, or from one or more other computer systems over a network connection.
  • a server computer may transmit a sequence of instructions to the host computer in response to a message transmitted to the server over a network by the host.
  • the host receives the instructions over the network connection, it stores the instructions in memory.
  • the host may store the instructions for later execution or execute the instructions as they arrive over the network connection.
  • the downloaded instructions may be directly supported by the CPU.
  • the instructions may not be directly executable by the CPU, and may instead be executed by an interpreter that interprets the instructions.
  • hardwired circuitry may be used in place of, or in combination with, software instructions to implement the present invention.
  • the present invention is not limited to any specific combination of hardware circuitry and software, nor to any particular source for the instructions executed by the host computer.
  • Figure 4 illustrates a system 400 in which a host computer 402 is connected to a remote computer 404 through a network 410.
  • the network interface between host computer 402 and remote 404 may also include one or more routers, such as routers 406 and 408, which serve to buffer and route the data transmitted between the host and client computers.
  • Network 410 may be the Internet, a Wide Area Network (WAN), a Local Area Network (LAN), or any combination thereof.
  • the remote computer 404 may be a World-Wide Web (WWW) server that stores data in the form of 'web pages' and transmits these pages as Hypertext Markup Language (HTML) files over the Internet network 410 to host computer 402.
  • WWW World-Wide Web
  • FIG. 5 is a block diagram of a representative networked computer, such as host computer 402 illustrated in Figure 4.
  • the computer system 500 includes a processor 502 coupled through a bus 501 to a random access memory (RAM) 504, a read only memory (ROM) 506, and a mass storage device 507.
  • Mass storage device 507 could be a disk or tape drive for storing data and instructions.
  • a display device 520 for providing visual output is also coupled to processor 502 through bus 501.
  • Keyboard 521 is coupled to bus 501 for communicating information and command selections to processor 502.
  • cursor control unit 522 is Another type of user input device, which may be a device such as a mouse or trackball, for communicating direction commands that control cursor movement on display 520.
  • audio output port 524 is also coupled to processor 502 through bus 501 for connection to speakers that output audio signals produced by computer 500.
  • I/O input/ output
  • network interface device 523 for providing a physical and logical connection between computer system 500 and a network.
  • Network interface device 523 is used by various communication applications running on computer 500 for communicating over a network medium and may represent devices such as an ethernet card, ISDN card, or similar devices.
  • Modem 526 interfaces computer system 500 to a telephone line and translates digital data produced by the computer into analog signals that can be transmitted over standard telephone lines, such as by telephone system 412 in Figure 4.
  • modem 526 provides a hardwired interface to a telephone wall jack, however modem 526 could also represent a wireless modem for communication over cellular telephone networks.
  • Figure 4 the architecture of Figure 4 is provided only for purposes of illustration, and that a host computer used in conjunction with the present invention is not limited to the specific architecture shown.
  • Figure 6 shows an example of the groups and fields of two different documents, a source document format 610 and a target document format 620.
  • the document is a purchase order.
  • the document may convey any information that one person or business wants to send to another person or business.
  • the source group 615 includes the source fields of name, address, city, description, price, quantity, and total.
  • the target group 625 includes the fields name, location, information, cost, number, and amount.
  • the formats of the fields in the source and target groups are structurally different, they have similarities and common abstractions such as name, amount, and place to ship the goods.
  • the names of the fields in groups 615 and 625 may be different, such as "price" and "cost,” for example, but the data 617 and 627 contained in these fields is the same.
  • a virtual field that corresponds to a field in the source and target groups 615 and 625 can be used to capture these common abstractions using meta-data.
  • meta-data associated with the source document can be used by the mapping engine to define one or more aggregate virtual fields.
  • the meta-data used to define the aggregate virtual fields can be obtained from a data structure such as the data structure of Figure 3.
  • the mapping engine can apply mapping rules to the meta-data associated with the source group, including the aggregate virtual fields, to automatically generate a transform.
  • the transform is then provided to the translation engine, which uses the transform to convert the source document into the target document.
  • a mapping engine 650 creates a translation map, as shown in Figure 6.
  • the translation map is used by a translation engine 630 to convert, or translate a message from a source format to a target format.
  • the translation map is a metadata level description of the fields in the source document that will be used to populate a field in the target document.
  • Figure 7 shows an embodiment of a method for automatically generating a transform using aggregate virtual fields.
  • One or more aggregate virtual fields for a first document are defined, step 710.
  • the aggregate virtual fields are defined using meta-data contained in the data structure of Figure 3.
  • One or more of these aggregate virtual fields are enabled, so that the enabled aggregate virtual fields appear in the first document, step 720.
  • One or more of the aggregate virtual fields may be disabled, so that the disabled aggregate virtual fields do not appear in the first document, step 720.
  • Mapping rules to map data from fields in the first document to fields in a second document are defined, step 740. Then, a transform to convert the first document into the second document is automatically generated by applying the mapping rules to the meta-data, including the enabled aggregate virtual fields, of the first and second documents.
  • Virtual aggregate fields thus provide several advantages.
  • the virtual aggregate fields allow code to automap between source and target documents.
  • the automapping code enables aggregate virtual fields as needed - if it discovers that a field under a aggregate virtual field that could potentially be enabled is required by the automapping mechanism, it enables the aggregate virtual field.
  • mapping tends to be sufficient if a aggregate virtual field is involved.
  • the user does not need to write code to identify when the data of the fields of a aggregate virtual field have a particular meaning, or write code to convert between the aggregate virtual field and its constituent fields.
  • mapping from document A to B is much closer to mapping from B to A than without this invention.
  • the mapping from B to A has been made closer to the transposition of the mapping from A to B.
  • Mapping one direction then provides most of the information needed to map the other direction. If users had to write code to map from A to B, such a transposition would be far more work. With this invention, transposing a mapping is far less work.
  • mapping to or from fields under a aggregate virtual field is translation-engine independent (assuming the code of the MapTo and MapFrom functions can support the target translation engines). We merely generate the code appropriate for that translation engine when writing out the transform in the way that translation engine requires.
  • mapping difference checking is easier. This way, once the MapTo and MapFrom functions have been validated, mapping difference checking is simplified.
  • mapping maps are more translation-engine independent, as less overall code is needed.
  • Tenth creating a map is faster, as automapping has a better hit rate.
  • maps have fewer bugs, as users don't need to write code. Thus, debugging a mapping is faster.
  • Twelfth time to market is faster for users. Thirteenth, this mechanism works with virtual fields and groups.

Abstract

A method comprising defining one or more aggregate virtual fields (Figure 6) for a first document (610) using meta-data associated with the first document (615) is disclosed.

Description

VIRTUAL AGGREGATE FIELDS
FIELD OF INVENTION
The invention is related to the field of representation and translation of electronic documents.
BACKGROUND OF THE INVENTION
Some documents have fields that must be combined or massaged to produce data that matches a desired "meaning". For example, in a document, the following structure may be present in Documentl:
Store_Identifier (group)
DUNS_Number (9-digit field) Store_Number (4-digit field)
And in Document2: Location (11-digit field)
Where the Location is the DUNS_Number with the Store_Number concatenated to it. In this case, a "meaning" is part of multiple fields in Documentl.
Conventional solutions to mapping require users to write customized code to handle such cases. For example, suppose Vendorl has a mapping tool that Customerl uses. Customerl is mapping Documentl to Document2. Customerl writes code like:
Tar get. Location = concati Source.Store_Identifier.DUNS_Number, Source. Store_Identifier. Store Number)
Locations in a document can have more than one meaning. This means that mapping is hard to automate. Instead, the mappings must be manually done and require customized code, which does not allow reuse of mapping knowledge and rules.
Conventional solutions have several disadvantages. First, both mapping and the mapping rules are one-off. That is, each time a user wants to define how to perform a document translation, similar code must be written and tested. This increases the time needed to define how to translate from the source to the target document.
Furthermore, both mapping and the mapping rules depend on user- written code. This makes it hard to automatically validate the integrity of the mapping. It also sets a minimum bar for the skill level of anyone trying to define a mapping, as they must then know all the document locations that might hold a particular meaning, and must be skilful enough to write the code to handle the case. This imposes a maintenance burden, as fixing a problem in a mapping requires altering code.
The mapping and the mapping rules are translation-language dependent. The code that must be written and tested depends on the underlying translation engine that will translate the documents. Thus, mapping rules will be translation-engine dependent, and that a translation defined for one translation engine will likely need adjusting to make the mapping work on a different translation engine. Moving a transform from one translation engine to another is difficult.
The source and target mappings must be significantly different. The code for handling the case described above will differ whether the document is the source or the target document. If one has mapped from A to B, mapping from B to A requires major rework, as the code for the mapping would have to be rewritten using different logic.
Conventional mapping tools previously use superficial similarities in field names or document structure as the basis for automapping. They can not automap to virtual structures, forcing users to write code. SUMMARY OF THE INVENTION
A method comprising defining one or more aggregate virtual fields for a first document using meta-data associated with the first document is disclosed.
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings, in which like references indicate similar elements, and in which:
Figure 1 illustrates one embodiment of a data structure for part of a document.
Figure 2 illustrates another embodiment of a data structure for the same document as Figure 1.
Figure 3 illustrates one embodiment of a data structure including the information needed to create the virtual aggregate field of Figure 2.
Figure 4 is an example of a network that uses aggregate virtual fields to translate documents.
Figure 5 is an example of a computer system that uses aggregate virtual fields to translate documents.
Figure 6 is an example of a translation system that uses aggregate virtual fields to translate documents.
Figure 7 is an embodiment of a method for automatically generating a transform using aggregate virtual fields.
DETAILED DESCRIPTION
A transform to perform mappings can be automatically generated using virtual aggregate fields. The virtual aggregate fields can be used to create a named field in the document, and associate it with a set of fields with a "map- to" and/or a "map-from" rule. All mapping and mapping rule operations can then be applied on the new field. The transform to perform the mappings can then be automatically generated, as discussed below. Figure 1 depicts a data structure for part of a document. Groupl has other groups under it, GroupA, GroupB and GroupC. GroupB has other groups under it, including Groupl, which has field Fieldl.3 in it. The full path through this document, from the top as shown down to Fieldl.3, is described as:
Groupl/GroupB/Groupl/Fieldl .3
Fieldl.3 and Field3.2 hold information that together represent a "meaning" of the document. It is important to realize that in many concrete documents a structure like GroupB can repeat many times. Figure 1 is an example of one embodiment of meta-data. In a concrete document, GroupB can appear multiple times. This is a particular instance of GroupB. Each instance of GroupB can have different data in its fields.
Enabling and Disabling a Virtual Aggregate Field
Figure 2 depicts a data structure for the same document as Figure 1, except that field F_Meaningl defined. That is, if a mapping accesses field F_Meaningl in a source document, the transform will automatically use the MapFrom code. If a mapping accesses field F_Meaningl in a target document, the transform will automatically use the MapTo code.
F_Meaningl is an enabled virtual aggregate field. That is, it appears in the document. There are events, such as the user in the GUI, that trigger the virtual aggregate field to be enabled - inserted into the document. Similarly, a virtual aggregate field can be disabled and disappear from the document.
Source Data for Defining Virtual Aggregate Fields
The descriptions of virtual aggregate fields can be stored in an external data source. Figure 3 illustrates a data structure including the information needed to create the virtual aggregate field of Figure 2. The information needed to generate a virtual group is:
• Name - the name of the new field.
• ParentGroup - the location of the new field. • Description - a textual description of the general meaning of the new field.
• ParticipatingFields - the fields that hold the source data of the new field.
• MapTo - code that, given the value of the virtual field, populates the ParticipatingFields.
• MapFrom - code that, given the ParticipatingFields, produces the value of the virtual aggregate field.
Using Virtual Groups in Source Documents to Automatically Generate a Transform
Users can apply mapping rules to meta-data to map from a field under a virtual aggregate field in the source document to the relevant location(s) in the target document. A virtual aggregate field in a source document can be treated just like any other field. Whatever operations - move, or any other mapping rule that might be applied to other fields - apply to such fields.
A transform is the code used by a translation engine to convert one concrete document to another. A transform is generated by applying mapping rules to the meta-data of the source and target documents. After the mapping rules and the meta-data, including virtual aggregate fields, are defined, a transform can be automatically generated, which performs the following processing on fields in a virtual aggregate field defined for a concrete source document:
Apply the MapTo function, and return its value.
Note that simple extension to support default values, etc. are within the scope of the invention. For example, a default value for a participating field in a virtual aggregate field could be added to the information in Figure 3. Looping functionality could also be supported to create a virtual aggregate field that combines the information of multiple structures into a single value. Using Virtual Groups in Target Documents to Automatically Generate a Transform
Users can map from location(s) in the source document to a virtual aggregate field in the target document. A virtual aggregate field in a target document can be treated just like any other field. Whatever operations - move, or any other manipulation rule that might be applied to other fields - apply to virtual aggregate fields.
After a transformation is defined, a transform can be automatically generated, which will do this processing on virtual aggregate fields defined for a concrete target document:
Apply the MapFrom function on the data mapped to the virtual aggregate field, in order to populate the participating fields.
Note that simple extension to support default values, etc. are within the scope of the invention. For example, a default value for a participating field of a virtual aggregate field could be added to the information of Figure 3, such that the participating field is conditionally populated. Users can write the code of the MapTo and MapFrom functions.
Hardware Overview
According to the present invention, a host computer system transmits and receives data over a computer network or standard telephone line. According to one embodiment, the steps of accessing, downloading, and manipulating the data, as well as other aspects of the present invention are implemented by a central processing unit (CPU) in the host computer executing sequences of instructions stored in a memory. The memory may be a random access memory (RAM), read-only memory (ROM), a persistent store, such as a mass storage device, or any combination of these devices. Execution of the sequences of instructions causes the CPU to perform steps according to the present invention. The instructions may be loaded into the memory of the host computer from a storage device, or from one or more other computer systems over a network connection. For example, a server computer may transmit a sequence of instructions to the host computer in response to a message transmitted to the server over a network by the host. As the host receives the instructions over the network connection, it stores the instructions in memory. The host may store the instructions for later execution or execute the instructions as they arrive over the network connection. In some cases, the downloaded instructions may be directly supported by the CPU. In other cases, the instructions may not be directly executable by the CPU, and may instead be executed by an interpreter that interprets the instructions. In other embodiments, hardwired circuitry may be used in place of, or in combination with, software instructions to implement the present invention. Thus, the present invention is not limited to any specific combination of hardware circuitry and software, nor to any particular source for the instructions executed by the host computer.
Figure 4 illustrates a system 400 in which a host computer 402 is connected to a remote computer 404 through a network 410. The network interface between host computer 402 and remote 404 may also include one or more routers, such as routers 406 and 408, which serve to buffer and route the data transmitted between the host and client computers. Network 410 may be the Internet, a Wide Area Network (WAN), a Local Area Network (LAN), or any combination thereof. The remote computer 404 may be a World-Wide Web (WWW) server that stores data in the form of 'web pages' and transmits these pages as Hypertext Markup Language (HTML) files over the Internet network 410 to host computer 402. To access these files, host computer 402 runs a 'web browser', which is simply an application program for accessing and providing links to web pages available on various Internet sites. Host computer 402 is also configured to communicate to telephone system 412 through a telephone interface, typically a modem. Figure 5 is a block diagram of a representative networked computer, such as host computer 402 illustrated in Figure 4. The computer system 500 includes a processor 502 coupled through a bus 501 to a random access memory (RAM) 504, a read only memory (ROM) 506, and a mass storage device 507. Mass storage device 507 could be a disk or tape drive for storing data and instructions. A display device 520 for providing visual output is also coupled to processor 502 through bus 501. Keyboard 521 is coupled to bus 501 for communicating information and command selections to processor 502. Another type of user input device is cursor control unit 522, which may be a device such as a mouse or trackball, for communicating direction commands that control cursor movement on display 520. Also coupled to processor 502 through bus 501 is an audio output port 524 for connection to speakers that output audio signals produced by computer 500.
Further coupled to processor 502 through bus 501 is an input/ output (I/O) interface 525, and a network interface device 523 for providing a physical and logical connection between computer system 500 and a network. Network interface device 523 is used by various communication applications running on computer 500 for communicating over a network medium and may represent devices such as an ethernet card, ISDN card, or similar devices.
Modem 526 interfaces computer system 500 to a telephone line and translates digital data produced by the computer into analog signals that can be transmitted over standard telephone lines, such as by telephone system 412 in Figure 4. In an embodiment of the present invention, modem 526 provides a hardwired interface to a telephone wall jack, however modem 526 could also represent a wireless modem for communication over cellular telephone networks. It should be noted that the architecture of Figure 4 is provided only for purposes of illustration, and that a host computer used in conjunction with the present invention is not limited to the specific architecture shown.
Figure 6 shows an example of the groups and fields of two different documents, a source document format 610 and a target document format 620. In this embodiment, the document is a purchase order. However, the document may convey any information that one person or business wants to send to another person or business. The source group 615 includes the source fields of name, address, city, description, price, quantity, and total. The target group 625 includes the fields name, location, information, cost, number, and amount. Although the formats of the fields in the source and target groups are structurally different, they have similarities and common abstractions such as name, amount, and place to ship the goods. Thus, the names of the fields in groups 615 and 625 may be different, such as "price" and "cost," for example, but the data 617 and 627 contained in these fields is the same.
A virtual field that corresponds to a field in the source and target groups 615 and 625 can be used to capture these common abstractions using meta-data. For example, meta-data associated with the source document can be used by the mapping engine to define one or more aggregate virtual fields. The meta-data used to define the aggregate virtual fields can be obtained from a data structure such as the data structure of Figure 3. After the aggregate virtual fields are denned, the mapping engine can apply mapping rules to the meta-data associated with the source group, including the aggregate virtual fields, to automatically generate a transform. The transform is then provided to the translation engine, which uses the transform to convert the source document into the target document.
A mapping engine 650 creates a translation map, as shown in Figure 6. The translation map is used by a translation engine 630 to convert, or translate a message from a source format to a target format. The translation map is a metadata level description of the fields in the source document that will be used to populate a field in the target document.
Figure 7 shows an embodiment of a method for automatically generating a transform using aggregate virtual fields. One or more aggregate virtual fields for a first document are defined, step 710. The aggregate virtual fields are defined using meta-data contained in the data structure of Figure 3. One or more of these aggregate virtual fields are enabled, so that the enabled aggregate virtual fields appear in the first document, step 720. One or more of the aggregate virtual fields may be disabled, so that the disabled aggregate virtual fields do not appear in the first document, step 720. Mapping rules to map data from fields in the first document to fields in a second document are defined, step 740. Then, a transform to convert the first document into the second document is automatically generated by applying the mapping rules to the meta-data, including the enabled aggregate virtual fields, of the first and second documents.
Virtual aggregate fields thus provide several advantages. First, the virtual aggregate fields allow code to automap between source and target documents. The automapping code enables aggregate virtual fields as needed - if it discovers that a field under a aggregate virtual field that could potentially be enabled is required by the automapping mechanism, it enables the aggregate virtual field.
Second, merely doing the mapping tends to be sufficient if a aggregate virtual field is involved. Once the virtual aggregate field is defined, the user does not need to write code to identify when the data of the fields of a aggregate virtual field have a particular meaning, or write code to convert between the aggregate virtual field and its constituent fields.
Third, the ability to write code is not compromised. This new technique can co-exist with the older way of doing things. Fourth, transformation instructions can be successfully generated for the translation engines.
Fifth, mapping from document A to B is much closer to mapping from B to A than without this invention. Thus, the mapping from B to A has been made closer to the transposition of the mapping from A to B. Mapping one direction then provides most of the information needed to map the other direction. If users had to write code to map from A to B, such a transposition would be far more work. With this invention, transposing a mapping is far less work. Sixth, mapping to or from fields under a aggregate virtual field is translation-engine independent (assuming the code of the MapTo and MapFrom functions can support the target translation engines). We merely generate the code appropriate for that translation engine when writing out the transform in the way that translation engine requires.
Seventh, without needing to perform complicated analyses of user- written code, we can validate mappings to and from fields under aggregate virtual fields, as most cases do not require the user to write code. Because fewer mappings require the user to write code, mapping difference checking is easier. This way, once the MapTo and MapFrom functions have been validated, mapping difference checking is simplified.
Eighth, a non-programmer can do most of the work of mapping. Ninth, maps are more translation-engine independent, as less overall code is needed. Tenth, creating a map is faster, as automapping has a better hit rate. Eleventh, maps have fewer bugs, as users don't need to write code. Thus, debugging a mapping is faster. Twelfth, time to market is faster for users. Thirteenth, this mechanism works with virtual fields and groups.
Fourteenth, nesting of virtual groups works. That is, the meaning of a structure in a concrete document can depend on several qualifier fields as discussed above.
These and other embodiments of the present invention may be realized in accordance with these teachings and it should be evident that various modifications and changes may be made in these teachings without departing from the broader spirit and scope of the invention. The specification and drawings are, accordingly, to be regarded in an illustrative rather than restrictive sense and the invention measured only in terms of the claims.

Claims

CLAIMSWhat is claimed is:
1. A method comprising: defining one or more aggregate virtual fields for a first document using metadata associated with the first document.
2. The method of claim 1 further comprising enabling one or more aggregate virtual fields so that the enabled aggregate virtual fields appear in the first document.
3. The method of claim 2 further comprising defining mapping rules to map data from fields in the first document to fields in a second document.
4. The method of claim 3 further comprising automatically generating a transform to convert the first document into the second document by applying the mapping rules to the meta-data and the enabled virtual aggregate fields.
5. The method of claim 1, wherein each aggregate virtual field is defined using meta-data contained in a data structure, said data structure having a name data element, a parent group data element, a participating fields data element, a map-to data element, and a map-from data element.
6. A computer readable medium having instructions which, when executed by a processing system, cause the system to: define one or more aggregate virtual fields for a first document using meta-data associated with the first document.
7. The medium of claim 6 wherein the executed instructions further cause the system to enable one or more aggregate virtual fields so that the enabled aggregate virtual fields appear in the first document.
8. The medium of claim 7 wherein the executed instructions further cause the system to define mapping rules to map data from fields in the first document to fields in a second document.
9. The medium of claim 8 wherein the executed instructions further cause the system to automatically generate a transform to convert the first document into the second document by applying the mapping rules to the meta-data and the enabled virtual aggregate fields.
10. The medium of claim 6, wherein each aggregate virtual field is defined using meta-data contained in a data structure, said data structure having a name data element, a parent group data element, a participating fields data element, a map-to data element, and a map-from data element.
11. An apparatus comprising: means for associating meta-data with a first document; and means for defining one or more aggregate virtual fields for the first document using meta-data associated with the first document.
12. The apparatus of claim 11 further comprising means for enabling one or more aggregate virtual fields so that the enabled aggregate virtual fields appear in the first document.
13. The apparatus of claim 12 further comprising means for defining mapping rules to map data from fields in the first document to fields in a second document.
14. The apparatus of claim 13 further comprising means for automatically generating a transform to convert the first document into the second document by applying the mapping rules to the meta-data and the enabled virtual aggregate fields.
15. The apparatus of claim 11, wherein each aggregate virtual field is defined means for using meta-data contained in a data structure, said data structure having a name data element, a parent group data element, a participating fields data element, a map-to data element, and a map-from data element.
PCT/US2001/011418 2000-08-29 2001-04-06 Virtual aggregate fields WO2002019164A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU2001251451A AU2001251451A1 (en) 2000-08-29 2001-04-06 Virtual aggregate fields
CA002420788A CA2420788A1 (en) 2000-08-29 2001-04-06 Virtual aggregate fields
EP01924833A EP1330735A1 (en) 2000-08-29 2001-04-06 Virtual aggregate fields
JP2002523205A JP2004507840A (en) 2000-08-29 2001-04-06 Virtual aggregate field

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/650,219 US6694338B1 (en) 2000-08-29 2000-08-29 Virtual aggregate fields
US09/650,219 2000-08-29

Publications (1)

Publication Number Publication Date
WO2002019164A1 true WO2002019164A1 (en) 2002-03-07

Family

ID=24607987

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/011418 WO2002019164A1 (en) 2000-08-29 2001-04-06 Virtual aggregate fields

Country Status (6)

Country Link
US (1) US6694338B1 (en)
EP (1) EP1330735A1 (en)
JP (1) JP2004507840A (en)
AU (1) AU2001251451A1 (en)
CA (1) CA2420788A1 (en)
WO (1) WO2002019164A1 (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7463732B1 (en) * 2000-05-05 2008-12-09 3Com Corporation Flexible data outlet
US8788931B1 (en) 2000-11-28 2014-07-22 International Business Machines Corporation Creating mapping rules from meta data for data transformation utilizing visual editing
US20020143809A1 (en) * 2001-03-28 2002-10-03 Bennett Paul W. System and method for calculation using multi-field columns with hidden fields
JP2003150586A (en) * 2001-11-12 2003-05-23 Ntt Docomo Inc Document converting system, document converting method and computer-readable recording medium with document converting program recorded thereon
US6917953B2 (en) * 2001-12-17 2005-07-12 International Business Machines Corporation System and method for verifying database security across multiple platforms
US7139766B2 (en) * 2001-12-17 2006-11-21 Business Objects, S.A. Universal drill-down system for coordinated presentation of items in different databases
US7720730B2 (en) * 2001-12-18 2010-05-18 Siebel Systems, Inc. Method and apparatus for capturing consumer loan application data
US7284005B1 (en) * 2001-12-18 2007-10-16 Siebel Systems, Inc. Data transfer method and system
US8160955B2 (en) * 2001-12-18 2012-04-17 Siebel Systems, Inc. Method and apparatus for capturing commercial loan application data and assigning a commercial loan request
US7818342B2 (en) * 2004-11-12 2010-10-19 Sap Ag Tracking usage of data elements in electronic business communications
US7711676B2 (en) * 2004-11-12 2010-05-04 Sap Aktiengesellschaft Tracking usage of data elements in electronic business communications
US7865519B2 (en) * 2004-11-17 2011-01-04 Sap Aktiengesellschaft Using a controlled vocabulary library to generate business data component names
US7624384B2 (en) * 2004-11-30 2009-11-24 Intel Corporation Apparatus, system, and method of dynamic binary translation with translation reuse
US7840610B2 (en) * 2005-05-11 2010-11-23 International Business Machines Corporation Apparatus, system, and method for map definition generation
EP1760646A3 (en) * 2005-08-30 2007-04-18 Sap Ag System and method for managing and regulating object allocations
US7716255B2 (en) * 2005-11-30 2010-05-11 Sap Ag Modeling a data element
US8554538B2 (en) * 2006-04-07 2013-10-08 Sap Ag Generating a unique name for a data element
US7856597B2 (en) * 2006-06-01 2010-12-21 Sap Ag Adding tag name to collection
US8078568B2 (en) * 2007-06-25 2011-12-13 Sap Ag Properties of data elements
US8086646B2 (en) * 2007-07-20 2011-12-27 Sap Ag Scheme-based identifier
US8280755B2 (en) * 2007-10-30 2012-10-02 Sap Ag Context-specific modeling of collaborative business process
US8041746B2 (en) * 2007-10-30 2011-10-18 Sap Ag Mapping schemas using a naming rule
US9063800B2 (en) 2010-05-26 2015-06-23 Honeywell International Inc. Automated method for decoupling avionics application software in an IMA system
US20150294423A1 (en) * 2014-04-14 2015-10-15 Thomson Reuters (Markets) Llc Systems and methods for flexible field mapping

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6067552A (en) * 1995-08-21 2000-05-23 Cnet, Inc. User interface system and method for browsing a hypertext database

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5119465A (en) 1989-06-19 1992-06-02 Digital Equipment Corporation System for selectively converting plurality of source data structures through corresponding source intermediate structures, and target intermediate structures into selected target structure
GB9005697D0 (en) 1990-03-14 1990-05-09 Digital Equipment Int Data format conversion
US5369761A (en) 1990-03-30 1994-11-29 Conley; John D. Automatic and transparent denormalization support, wherein denormalization is achieved through appending of fields to base relations of a normalized database
US5272628A (en) * 1990-04-16 1993-12-21 Microsoft Corporation Method and system for aggregating tables having dissimilar formats
US5367619A (en) * 1990-04-27 1994-11-22 Eaton Corporation Electronic data entry system employing an expert system to facilitate generation of electronic data forms with complex interrelationships between fields and subforms
US5276793A (en) * 1990-05-14 1994-01-04 International Business Machines Corporation System and method for editing a structured document to preserve the intended appearance of document elements
EP0575358B1 (en) * 1991-03-12 1997-07-09 Wang Laboratories, Inc. Database management system graphical query front end
US5442778A (en) * 1991-11-12 1995-08-15 Xerox Corporation Scatter-gather: a cluster-based method and apparatus for browsing large document collections
JP2894676B2 (en) * 1994-03-21 1999-05-24 インターナショナル・ビジネス・マシーンズ・コーポレイション Asynchronous remote copy system and asynchronous remote copy method
US5584024A (en) * 1994-03-24 1996-12-10 Software Ag Interactive database query system and method for prohibiting the selection of semantically incorrect query parameters
US5627979A (en) 1994-07-18 1997-05-06 International Business Machines Corporation System and method for providing a graphical user interface for mapping and accessing objects in data stores
JPH0969101A (en) 1995-08-31 1997-03-11 Hitachi Ltd Method and device for generating structured document
US5724573A (en) 1995-12-22 1998-03-03 International Business Machines Corporation Method and system for mining quantitative association rules in large relational tables
US5960435A (en) * 1997-03-11 1999-09-28 Silicon Graphics, Inc. Method, system, and computer program product for computing histogram aggregations
CA2329345A1 (en) 1997-04-22 1998-10-29 Greg Hetherington Method and apparatus for processing free-format data
JPH10307816A (en) 1997-05-08 1998-11-17 Just Syst Corp Structured document processor its processing method and computer readable recording medium recording program for allowing computer to execute the method
US5946688A (en) * 1997-10-23 1999-08-31 Canon Kabuskiki Kaisha Extendable data storage system and method for storing relationships between UIDs
US6009429A (en) 1997-11-13 1999-12-28 International Business Machines Corporation HTML guided web tour
US6085196A (en) 1997-12-23 2000-07-04 Ricoh Company, Ltd. Object-oriented system and computer program product for mapping structured information to different structured information
IL122850A0 (en) 1998-01-05 1999-03-12 Wizsoft Pattern recognition using generalized association rules
US6128623A (en) * 1998-04-15 2000-10-03 Inktomi Corporation High performance object cache
US6128627A (en) * 1998-04-15 2000-10-03 Inktomi Corporation Consistent data storage in an object cache
US6209003B1 (en) * 1998-04-15 2001-03-27 Inktomi Corporation Garbage collection in an object cache
US6546381B1 (en) * 1998-11-02 2003-04-08 International Business Machines Corporation Query optimization system and method
US6499041B1 (en) 1998-12-29 2002-12-24 International Business Machines Corporation Methods, systems and computer program products for copying between templates having associated field designations
US6327594B1 (en) * 1999-01-29 2001-12-04 International Business Machines Corporation Methods for shared data management in a pervasive computing environment
US6282544B1 (en) * 1999-05-24 2001-08-28 Computer Associates Think, Inc. Method and apparatus for populating multiple data marts in a single aggregation process
US6434568B1 (en) * 1999-08-31 2002-08-13 Accenture Llp Information services patterns in a netcentric environment
US6529909B1 (en) * 1999-08-31 2003-03-04 Accenture Llp Method for translating an object attribute converter in an information services patterns environment
US6484177B1 (en) * 2000-01-13 2002-11-19 International Business Machines Corporation Data management interoperability methods for heterogeneous directory structures

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6067552A (en) * 1995-08-21 2000-05-23 Cnet, Inc. User interface system and method for browsing a hypertext database

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KOTSAKIS EVANGELOS ET AL.: "XML schema directory: A data structure for XML data processing", PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, vol. 1, 19 June 2000 (2000-06-19) - 21 June 2000 (2000-06-21), pages 62 - 69, XP002941046 *

Also Published As

Publication number Publication date
JP2004507840A (en) 2004-03-11
EP1330735A1 (en) 2003-07-30
AU2001251451A1 (en) 2002-03-13
CA2420788A1 (en) 2002-03-07
US6694338B1 (en) 2004-02-17

Similar Documents

Publication Publication Date Title
US6694338B1 (en) Virtual aggregate fields
US6169992B1 (en) Search engine for remote access to database management systems
US6523042B2 (en) System and method for translating to and from hierarchical information systems
US7386609B2 (en) Method, system, and program for managing devices in a network
JP5431513B2 (en) Interpreting command scripts using local and extended storage for command indexing
JP5710851B2 (en) System and method for impact analysis
US6931409B2 (en) Method, apparatus, and program to efficiently serialize objects
US9805009B2 (en) Method and device for cascading style sheet (CSS) selector matching
US20090254881A1 (en) Code generation techniques for administrative tasks
US7676797B2 (en) System and method for managing long names in an application programming interface
US20030140045A1 (en) Providing a server-side scripting language and programming tool
WO2001097078A2 (en) Method and apparatus for automatically selecting a rule
JP5113764B2 (en) Transfer and display hierarchical data between databases and electronic documents
US20170206188A1 (en) Manipulation of an ajax tree presentation
JP2004501448A (en) Method and apparatus for automatically converting the format of an electronic message
JPH10232813A (en) System, method, device and product for identification base caching
US20090012993A1 (en) Using viewtypes for accessing instance data structured by a base model
US20060095513A1 (en) Hypermedia management system
EP1328873A1 (en) Virtual groups
US7769750B2 (en) Metadata based hypermedia management system
CA2420817A1 (en) Virtual fields
WO2002019165A1 (en) Virtual fields
US20050273716A1 (en) Database server paging method
US8813101B2 (en) Software technique to correlate conceptually similar entities
Gill Building Web Applications with C# and. NET: A Complete Reference

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2420788

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2002523205

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 2001924833

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 2001924833

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2001924833

Country of ref document: EP