WO2003073319A2 - Interactive generation of graphical visualizations of large data structures - Google Patents

Interactive generation of graphical visualizations of large data structures Download PDF

Info

Publication number
WO2003073319A2
WO2003073319A2 PCT/EP2003/002070 EP0302070W WO03073319A2 WO 2003073319 A2 WO2003073319 A2 WO 2003073319A2 EP 0302070 W EP0302070 W EP 0302070W WO 03073319 A2 WO03073319 A2 WO 03073319A2
Authority
WO
WIPO (PCT)
Prior art keywords
records
visualization
cluster
parameters
record
Prior art date
Application number
PCT/EP2003/002070
Other languages
French (fr)
Other versions
WO2003073319A8 (en
Inventor
Thomas Baudel
Original Assignee
Ilog
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ilog filed Critical Ilog
Priority to AU2003210385A priority Critical patent/AU2003210385A1/en
Publication of WO2003073319A2 publication Critical patent/WO2003073319A2/en
Publication of WO2003073319A8 publication Critical patent/WO2003073319A8/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/20Drawing from basic elements, e.g. lines or circles
    • G06T11/206Drawing of charts or graphs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access

Definitions

  • the invention is directed to a method and system for visualization of information stored in a database, such as log files, customer databases, statistical data sets, and more particularly to graphically rendering such information with a high degree of flexibility to extract associations between database records.
  • the methods and computer-embodied programs described herein for graphically rendering, through visualization, data sets, in particular data contained in a table format include a visualization-generation process having as input a style-sheet (representing the visualizations' description) with a set of parameters and a data table.
  • the style-sheet will be referred to hereinafter also as a list of ETALV (Expression of Table Attributes And Local Variables).
  • the process is designed to render those visualizations that can be rendered in a time that is a linear function of the number of records.
  • This group of visualizations will be referred to hereinafter as "Compact Visualizations".
  • Compact Visualizations can be rendered with a small number of parameters which greatly enhances the flexibility of the described visualization process.
  • visualization of records arranged in a data table having columns and rows involves using a graphic user interface (GUI) to interactively set visualization parameters (ETALV) for the records of the data table.
  • GUI graphic user interface
  • ETALV visualization parameters
  • Those records of the data table that have a common ETALV value can then be clustered, and the records and the clusters of records are then sorted in a predetermined order which can also be set by the ETALV s.
  • a geometric attribute primitive is associated with the record or the cluster of records
  • a graphic attribute primitive is associated with an associated geometric attribute primitive.
  • the records or the clusters of records having the associated geometric and graphic attribute primitives are then graphically rendered.
  • Embodiments may include one or more of the following features.
  • Each ETALV operates on a single record of the database.
  • the records or the clusters of records can be graphically rendered in a time that is substantially a linear function of the number of records in the data table.
  • the geometric attribute primitives can include a position, a size, and a shape pattern of an element of a visualization, wherein the shape pattern can be rectangles, polygons, ellipses, and text, as well as other suitable custom shapes.
  • the graphic attribute primitives can be color, pattern, font, and/or line width, by may also include shading, illumination effects, and the like.
  • the visualization parameter structure can be organized as a hierarchical structure that includes sub-visualizations capable of rendering selected clusters.
  • Specific embodiments may include Gantt charts, histograms and 2D- and scatter plots, tree structures, regular tables, as well as combinations, such as a series of histograms laid out as a Gantt chart or presented in a table, a 2D plot of tree nodes, and the like.
  • the methods and programs described herein are also directed to computer programs with computer-executable code to provide a versatile and flexible user interface for generating visualizations in the manner described above.
  • FIG. 1 is a schematic flow diagram of the visualization-rendering process of the invention
  • Fig. 2 is a more detailed description of the algorithm used to render visualizations, presenting Fig. 1 as an algorithm rather than a flow diagram
  • Fig. 3 is a screen shot showing an exemplary organization of a display window
  • Fig. 4 is an exemplary representation of a population map
  • Fig. 5 shows the population map of Fig. 4 with embedded bar graphs
  • Fig. 5 A shows details of the bar graph of Fig. 5
  • Fig. 6 is an exemplary visualization rendered using the invention, depicting a Bar
  • Fig. 7 shows another visualization obtained by sub-clustering the data of Fig. 6.
  • a method and computer-executable program is described for graphically rendering information of a database which can be arranged in form of a table, and for extraction associations between database records.
  • An element and/or an arithmetic expression computed from the records in the database can be associated with a graphic representation consisting of instructions, such as setting a fill color, drawing a rectangle, and so on.
  • the proposed visualization-rendering method is computationally fast, since at most O(N) graphic instructions need to be generated for a database of N records.
  • Data Table - a data set organized in a matrix of rows and columns.
  • the columns are referred to as attributes of the data set.
  • Each column can have a header, a type, and additional properties, such as the format in which the data is stored and whether the column is to be cumulated or averaged when a set of records is aggregated in a group.
  • An instance of the data table has an arbitrary number of lines, often called records, or data objects. For each column, each line has an attribute value that describes the value of the given line for the given column.
  • a data table can also be a table in a relational database.
  • Graphic Language - A graphic language is a set of programming language functions and data types (i.e. a library) that enable describing images on a computer screen, printer, or other computer-driven image-producing device.
  • Modern graphic languages such as OpenGL, Postscript, Microsoft® GDI, Java2D and Java3D consist of two types of functions, geometric attribute primitives and graphic attribute primitives:
  • Geometric attribute primitives describe shapes to be rendered on the screen using Graphic Languages. Examples of geometric attribute primitives are: drawRectangle (x, y, w, h) , drawlcon (bitmap, x, y) , dra Label ( "hello” , x, y) .
  • . __ primitives also have positional parameters, for example, drawRectangle ( 0 , 0 , 120 , 120 ) which will draw a rectangle at the origin of the display (0,0) spanning 120 pixels in both width and height.
  • Graphic attribute primitives describe graphic attributes, such as color, font, and pattern, that are used in drawing the associated geometric attribute primitives. For example, setColor ( "red” ) followed by drawRectangle ( 0 , 0 , 120 , 120 ) will result in a red rectangle being drawn at the coordinates (0, 0, 120, 120).
  • setColor ( . . . ) can have more than one, for example three parameters that describe the RGB color space. It should be noted that there is at most one graphic attribute primitive of each sort being issued between two geometric attribute primitives.
  • Visualization - of a data table is a program, that, given as input any instance of the data table, outputs a uniquely defined sequence of graphic language instructions.
  • Representation - a representation of a data table is the output of a visualization given a data table instance as entry. It consists of a sequence of graphic language instructions only.
  • Expression of table attributes - is a sequence of programming language instructions that consists only of: constants, programming language operators (including functions), and data table attribute names. The attribute names stand for variables that, for each row of the column, will take the value of the current record for the current attribute.
  • Local variable - a variable of the programming language that is reevaluated once for each record in the data table instance and stored as each record is examined - sequentially.
  • Local variables store some state information that can be used, for example, to draw records next to or on top of each other.
  • Expressions of Table Attributes And Local Variables (hereafter referred to as "ETALV") are the parameters of the visualization description method and include the aforementioned expression of table attributes and optionally local variables.
  • the time for computing an ETALV is independent of the number of records held in the table, because the ETALVs do not include instructions that operate on more than a single record at a time.
  • table attributes and local variables need be computed only once for each record and can be cached in an array the size of the data table instance.
  • Compact visualization- Compact Visualizations are defined as those visualizations in which the graphic instruction sequence corresponding to any given data record or group of records is completely defined by a sequence of
  • a compact visualization can be rendered in time directly proportional of the number of records held in a table instance, plus a sorting phase, which according to generally accepted criteria has a complexity of O(n log n) in time, with some other methods purportedly sorting at O(n).
  • a Compact Visualization includes three distinct steps: (1) CLUSTER & SORT; (2) SHAPE SELECTION & LAYOUT; and (3) DECORATION.
  • the invention is directed to a method and a computer-implemented program for describing and then rendering compact visualizations. Those provide a graphic layout of
  • Fig. 1 the method for compact visualizations is depicted in a schematic high-level process flow diagram 100. The various steps of the process flow will be described in more detail below in the exemplary pseudocode and with reference to Figs. 2 and 3.
  • step 102 of Fig. 1 a user defines Expressions of Table Attributes And Local Variables (ETALV) to be applied to the compact visualization of records stored in table 103. Examples of ETALVs are described below with reference to Fig. 3.
  • ETALV Expressions of Table Attributes And Local Variables
  • step 104 the records are "clustered" into a hierarchy of clusters, whereby the data are organized in groups that share a common representation defined by the ETALV parameters of step 102. Thereafter, the data objects are sorted, step 106. Sorting is done after clustering, so that sorting is performed on smaller lists, resulting in significant optimization.
  • Shapes Selection & Layout step 108, one or more graphic primitives (such as a rectangle, a line, etc.) and the coordinates of these primitives are associated with the data objects and/or groups of the sorted hierarchy of clusters.
  • the produced geometric graphic primitives are decorated, for example, by specifying a color, shading and the like, and the decorated graphic primitives are displayed, step 112.
  • the parameters used in the various steps are defined in the ETALVs.
  • An instance of the class Visualization is defined by the ETALV parameters that describe a compact visualization. For each of those parameters, the user provides an Expression object that contributes to the definition of the visualization algorithm.
  • the main function of the Visualization class is a function draw (Table , Frame) . It contains further objects that detail how each step is to be performed: class Visualization ⁇
  • the Visualization-rendering algorithm can be expressed in pseudocode as follows:
  • t represents a set of records of the data table
  • f represents a frame of an object (screen, window)
  • i, j are local counters
  • CurViz is the Visualization structure defined by the ETALVs.
  • is used to parameterize each of the Clusterer,- Sorter, Layout and Decoration classes. It is the implementation of an ETALV as defined above.
  • the evaluate function can be defined by the user to return any object of the language, and it can perform any type of computation allowed by the program, using the values held in the r (Record)-attributes and v (Visualization)- variables.
  • the present implementation provides a simple and convenient expression parser that allows the user to define expression such as a sum of columns, a constant or any simple numeric expression of the language.
  • a Synthesizer class is added as follows: class Expression ⁇ Object evaluate (Record r, Visualization v) ; Object evaluate (Table t, Visualization v) ⁇ Synthesizer . evaluate (c, this, v) ;
  • the synthesizer class returns a condensed single value reflecting the values held in the sub-table. Because different types of attributes may have to be synthesized differently, a number of predefined synthesizers is provided, and users can add their own. For example, in a computer file system, file sizes are additive attributes, as are, in the exemplary database of cities used below, Populations. This means that the size of a group of files is the sum of their individual sizes, or that the Population of a table of cities is the sum of their populations. Not all attributes are additive. For example, if the attribute is the creation date of a file, then the creation date for a table of size is the minimum date of all creation dates. For a last modification date, it is the maximum value of all modifications date (i.e.
  • a visualization can define local variables. These are defined in the class
  • Variable class Variable ⁇ String name
  • This class implements the definition of Local Variable defined above.
  • the variables are defined by the user, who gives them a name, that will be used in Expression objects to access their current value, an initial expression that is evaluated to give the expression a default value at the start of each visualization-rendering process (step 208 of Table 1), and an incremental expression that is evaluated once for each record or group of records that the visualization process handles (after step 224 of Table 1).
  • the records of the data table instance that share a common representation and common local variables are "clustered" or grouped/aggregated based on a set of local variables and graphic primitives that are to be shared among one or several groups of records of the table.
  • the clustering process represents a partitioning operation, that is, a given record belongs to one and only one cluster.
  • a class Clusterer is defined to perform the operational step 204 of clustering records into groups of records:
  • Table 2 represents the pseudocode of the clustering process, which involves splitting a table into a group of smaller tables or individual records that each share a common property.
  • the process takes as input a table t, a Visualization v (to enable accessing the local variables it defines) and a clustering expression, here implemented as the cluster subroutine.
  • the clustering process 300 starts by erasing any cluster that could have been stored in the table of sub-clusters, line 302. Then, for each record held in the table t, the cluster function is called to evaluate in which class the record should belong, line 306. In line 308, the table of sub-tables held in v is examined if there is a sub-table existing for this identifier.
  • each cluster can be seen as a record whose values are synthesized from the records it holds.
  • DateClusterer a date attribute
  • each cluster can be seen as a record whose values are synthesized from the records it holds.
  • the starting date of a cluster of records is the minimum of the starting dates of its records, a numerical quantity can be synthesized by the average (arithmetic, geometric, or harmonic) of its values.
  • Default classes are provided for commonly used ways to aggregate a set of records into one single cluster to be viewed as a record. The number of records that can be aggregated is variable.
  • StringClusterer For example, a function StringClusterer would be defined as follow: class StringClusterer extends Clusterer ⁇ Expression stringExpression; ⁇ ;
  • dateExpression is an ETALV that returns the year held in r when v is empty, the month held in r when the list of subexpressions is a list of months, and so forth for day, hours, minutes and seconds. */ return dateExpression. evaluate (r, v) ; ⁇
  • a special Clusterer type could be implemented to cluster records together not according to some endogenous property, but using exogenous variables, wherein the placement of the sub-clusters is determined heuristically. This could be used, for example, to implement features such as 'squarified treemaps,' which does not represent a true compact visualization. Valid representations may still be produced, however with a reduced efficiency.
  • the clusters and records are sorted in a desired order, step 206, for example, according to the value of attributes in a data table.
  • the following exemplary program sorts a table according to an attribute of the data table.
  • Quicksort is a sorting algorithm, for example, the algorithm defined by Hoare in 1962 and available in most programming languages: Java, C++, etc.. Quicksort uses the function compare as a comparison function and is not linear in time, but sufficiently efficient to be useful with compact visualizations. Alternatively, a "radix" sort technique known in the art could be employed to further optimize the process and provide a linear rendering time.
  • the Shape Selection & Layout process assigns one or several geometric attribute primitives, such as shapes, to a record or cluster of records, steps 214 and 216.
  • shapes can include, for example, rectangles, text labels, lines, segments of arcs, and other geometric attribute primitives, such as disk portion primitives suitable for circular and pie charts as well as line primitives, to enable regular line charts as well as construction of objects by straight lines, such as rectangles, polygons, and Bezier curves.
  • a DECORATION process For each assigned primitive, a DECORATION process, step 218, sets certain graphic attributes for each record being drawn. Such attributes can be color (expressed in RGB, CYM, etc.), patterns, transparency, light coordinates used in certain illumination models described by 3D graphic languages, such as OpenGL and Java3D, line bevel properties, line thickness, and so on. Additional parameters can specify margins, borders, scale, ruler information, and the position of text labels relative to the center of the geometric attribute primitives.
  • the decoration phase is shown as being implemented after the layout phase, since each primitive shape generated by LAYOUT process is determined before being decorated. Thus, decoration is specific for the layout and shape selection. However, those skilled in the art will appreciate that the order in which the steps are performed need not be the same as the order of the logical dependencies between process steps. 5
  • the following program defines a Color Decoration class:
  • ColorDecoration extends Decoration ⁇ Expression redComponent; Expression greenComponent; 10 Expression blueComponent ; function dra (Record r, Visualization v) ⁇ setColor (redComponent .evaluate (r,v) , redComponent . evaluate (r,v) , redComponent . evaluate (r,v) ) ; 15 ⁇
  • a composite decoration enables condensing multiple decoration attributes:
  • PatternDecoration pattern function draw (Record r, Visualization v) ⁇ color.draw (r, v) ; 25 pattern. draw (r, v) ; > , ⁇
  • decorations such as transparency, light coordinates, line bevel properties, line thickness.
  • the Layout step 220 includes assigning parameters to each of the graphic primitives representing a record or a cluster.
  • a parameter can be unspecified, in which case this dimension will be maximized, or it can be an expression of table attributes or local variables.
  • a layout class for rectangles can be defined as: class RectangleLayout ⁇ Expression x; Expression y;
  • Text labels have a further "label" parameter.
  • the most obvious assignment of this parameter is to assign it an attribute name, such as the city name in our sample table data.
  • class TextLayout extends RectangleLayout ⁇ Expression label; function draw (Record r, Visualization v) ⁇ super .draw (r,v) ; setColor (0, 0, 0) ; // black. // Additional customization can be provided to allow the user // to choose a text color.
  • drawLabel (label . evaluate (r,v) , x. evaluate (r,v) + 2, y. evaluate (r,v) + 2) ; ⁇ ;
  • a layout with several geometric attribute primitives can easily be derived from the above classes.
  • a list of layouts and decorations, one for each attribute of the data table, is maintained for each record that is-to-be drawn, step 216.
  • local variables are evaluated once for each record or cluster of records to be rendered. These user-defined local variables often define a current position that is initialized to "0" and incremented by a height (to arrange objects in rows) or a width (to arrange objects in columns) of each record or cluster.
  • the visualization-rendering process is recursive, as described above with reference to steps 212 - 222, i.e., a cluster can be further partitioned into sub-clusters that group together records sharing some common display properties.
  • the visualization can describe different representations at each level in the hierarchy: a visualization data structure is a hierarchy that can describe sub- visualizations that will be used to render selected clusters.
  • step 214 all levels of sub-clusters use the same visualization.
  • Each city is referenced by city name, state, population, longitude, latitude, a case and study numbers (an identifier referring to the origin of the data) as well as the following numerical attributes expressed in relative numbers: climate, housing cost, quality of health care, crime rate, transportation, quality of education, arts, recreation and strength of the economy.
  • GUI 30 shows several windows 31, 32, 33, 34, and 35 allowing a user to manipulate data of the database table and display them using various visualizations.
  • Window 31 is referred to as "Model Inspector”, which displays at least one row of the columns in the data table to be visualized.
  • Window 32 is referred to as “Projection Inspector”, which displays the various parameters that can be set to-create a visualization, as well as their current settings.
  • Window 33 is referred to as "Data Table View”, which displays the representation of the data table in window 31, according to the visualization parameters specified in window 32.
  • the window 32 includes the fields "clustering”, “sorting”, “shape selection” (frame), “layout” and “color” and “fill” (decoration) that are the necessary steps defining a compact visualization, as described above.
  • the user can select the table attributes and local variables (ETALV) by changing the settings of these parameters in window 32, thereby allowing in a simple manner a change in the displayed representations.
  • ETALV table attributes and local variables
  • Window 34 is referred to as "Shelf. In this window displays in form of icons other visualizations of the same data table. These other visualizations can be recalled by clicking on the corresponding icon. Also provided are menu bars 35 that allow manipulation of the various windows, loading and storing of visualizations, data tables, cut/copy/paste operations and so forth. Many of these features are known from standard GUI interfaces.
  • a parameter in window 32 is selected and assigned a column name of the data table, or an arithmetic expression computed from arithmetic values in some of the columns.
  • the values in the "Population” column can be associated with the "Color” attribute in window 32, and the items in the table can be sorted by "Population”. This assignment of values would fill window 33 with horizontally extending rows of rainbow colors that are stacked in the vertical direction, thereby filling the window 33.
  • FIG. 4 Another exemplary display 40 in form of a scatter- plot using -the same database as before is depicted in Fig. 4.
  • the X coordinate 42 is assigned to the "Longitude” table attribute of the cities and the Y coordinate 44 to the "Latitude” attribute. This produces a 2D scatter plot of the cities on a map of the United States.
  • Both the "Size” attribute 45 and the “Color” attribute 46 are set to “Population”.
  • the "Fill” attribute 47 is set to "Plain”. Representations of this type are known in the art. However, as described above with reference to the data displayed in window 31 of Fig.
  • each city or area (and for the same reason each attribute in the table) has associated therewith other values, such as climate, housing cost, quality of health care, crime rate, transportation, etc., which can also be efficiently rendered graphically.
  • other values such as climate, housing cost, quality of health care, crime rate, transportation, etc.
  • display 50 displays a histogram 510 in form of a bar graph for each city or region, wherein the total width of the bar graph represents the population and the height of the individual bars represents other table attributes.
  • the histograms were generated by the application program simply by changing the "Fill" attribute 57 from “Plain” in Fig. 4 to "Bar chart” in Fig. 5.
  • FIG. 5 A A detailed view of the histogram 510 is presented in Fig. 5 A.
  • the bars 502, 503, ... , 508 may represent the values of the attributes selected from window 31. It may be possible for a user to click on the histogram to enlarge the scale and/or display the histogram for that city in a separate window (not shown).
  • the display of the histogram is controlled by commands entered, for example, by mouse clicks, in window 32.
  • Displaying the additionally selected attributes as bar graphs advantageously produces a simultaneous visualization of the database attributes for all cites on the map. This can give an easily recognizable indication as to which cities have better living conditions: those that are close to a full rectangle have better indices for all the above mentioned categories, while those that show big indentations have one or more weak areas.
  • graphic attributes can be recursively assigned to sub-components of an object. Not all visualization attributes need to be assigned to a column of the data model. Instead, they can be associated with an expression of the programming language, which can place them next to each other.
  • X 62 is assigned a "Packing" constant, while Y 64 is set to "0" and Height to "1".
  • the "packing" constant in the user interface sets the width to a local variable of value [Population of the Record]/[Total Population for the Cluster] and defines one local variable, curX, that is initialized to 0 and is incremented by width for each record. This has the effect of placing the cities next to each other, ordered by decreasing population.
  • Vertically stacked are the histograms for each record, i.e., climate, housing, health care quality, etc., creating a vertical bar for each attribute in the data table.
  • the displayed width 68 of each bar of the histogram is proportional to the population.
  • each bar population x attribute index
  • the "educationion" index 67 (height of the bar), shown in the fourth row of Fig. 6, for Philadelphia is greater that of New- York and LA.
  • the area under the respective bars for New- York and LA is larger than the area for Philadelphia. So, even though Philadelphia has a larger proportion of students and teachers in its population, NY and LA have a larger absolute student population.
  • Fig. 7 by selecting a suitable "Clustering" parameter 76, records in display 70 can be grouped and each group rendered using another sub-visualization. For example, Fig. 7 groups (clusters/aggregates) cities belonging to the same State, creating a matrix where each square represents the aggregated value for one State. Different parameters for each group of cities can be specified inside a squarerFor-example, in the small 2D scatter plots 78,
  • the "Housing Cost” index is plotted against the "Climate” index for each State with more than 4 cities in the sample.
  • This visualization shows some States (California (CA) 78 and Washington State (WA) 79) exhibit a strong correlation between Housing cost and climate, whereas no such correlation is evident for other States, such as Texas (TX) 77.
  • Different visualizations can be displayed simultaneously, for example side- by-side, in different windows by easy viewing.
  • the limited number of parameters in window 32 allows flexibility in moving from one visualization to another.
  • the visualizations are "compact” and can be rendered in a time that depends linearly on the number of records in the database.
  • a novel recursive model of visualizations within visualizations can be built by "clustering" attributes in the table database. Each cluster can have its own parameter set specifying, for example, layout, decoration, and sorting settings. Therefore, when changing a parameter, the user can specify whether they want the parameter change to be applied to
  • visualizations can be created to allow particular clusters to represent their content differently.
  • the disclosed method provides access to all visualizations that can be displayed in linear time, i.e., as a linear function of the number of records in the database.
  • the disclosed method is not limited by the size of the data structure " and could be extended to use data structures other than " a data table.
  • the data schema could, for example, be a regular XML DTD.
  • Each type of object to be presented would have to be placed, for example, in a predefined cluster.
  • Layout constraints would become much more complex, as the possible visualizations would no longer have a canonical representation.
  • This class of visualizations would be more difficult to customize, and the efficiency of the described visualization method may not be preserved. Accordingly, the spirit and scope of the present invention is to be limited only by the following claims.

Abstract

A method and computer-executable program is disclosed for graphically rendering information of a database which can be arranged in form of a table, and for extraction associations between database records. An element and/or an arithmetic expression computed from the records in the database can be associated with a graphic representation consisting of instructions, such as setting a fill color, drawing a rectangle, and so on. Graphic renditions can include tables, histograms, bar charts, timeline displays, pie charts, and tree maps, as well as combinations of such displays. The proposed visualization-rendering method is flexible and computationally fast, since at most ON graphic instructions need to be generated for a database of N records.

Description

INTERACTIVE GENERATION OF GRAPHICAL VISUALIZATIONS OF LARGE
DATA STRUCTURES
Field of the Invention The invention is directed to a method and system for visualization of information stored in a database, such as log files, customer databases, statistical data sets, and more particularly to graphically rendering such information with a high degree of flexibility to extract associations between database records.
Background of the Invention
The recent explosive growth of information storage and retrieval creates challenges for extracting meaningful information from large datasets of often seemingly unrelated information. Early methods of representing the information include scatter plots, bar graphs, pie charts, colored maps, tree structures and the like. For example, Microsoft® Excel™ enables viewing a data set as a table or as a chart, Spotfire and eBiz as interactively adjustable 2D and 3D scatter plots or in a parallel coordinate system; Inxight Eureka displays them in a table where numbers are replaced by small bars of length proportional to the value held in the cell. These products, as well as known research prototypes, such as Treemaps, The Influence Explorer or the Prosection Matrix are directed to a limited number of visualization types that are mapped from a data table to a set of graphic attributes. While those products allow customization, such as choosing the scale factors on a scatter plot, choosing the ordering of the columns, or choosing which values of the data are to be mapped onto the axes, none allows interactive browsing of the available space of the visualizations by adjusting a limited number of parameters.
-1- It would therefore be desirable to provide an interactive editor that produces a class of visualizations that can be rendered in real time and thereby remain efficient even for large datasets.
It would further be desirable to provide a simple, but highly flexible user interface for accessing the available visualization created in this way.
Summary of the Invention
The methods and computer-embodied programs described herein for graphically rendering, through visualization, data sets, in particular data contained in a table format, include a visualization-generation process having as input a style-sheet (representing the visualizations' description) with a set of parameters and a data table. The style-sheet will be referred to hereinafter also as a list of ETALV (Expression of Table Attributes And Local Variables). The process is designed to render those visualizations that can be rendered in a time that is a linear function of the number of records. This group of visualizations will be referred to hereinafter as "Compact Visualizations". Compact Visualizations can be rendered with a small number of parameters which greatly enhances the flexibility of the described visualization process.
According to one aspect of the invention, visualization of records arranged in a data table having columns and rows involves using a graphic user interface (GUI) to interactively set visualization parameters (ETALV) for the records of the data table. Those records of the data table that have a common ETALV value can then be clustered, and the records and the clusters of records are then sorted in a predetermined order which can also be set by the ETALV s. Based on the selected ETALV s for the record or the cluster of records, a geometric attribute primitive is associated with the record or the cluster of records, and a graphic attribute primitive is associated with an associated geometric attribute primitive. The records or the clusters of records having the associated geometric and graphic attribute primitives are then graphically rendered.
Embodiments may include one or more of the following features. Each ETALV operates on a single record of the database. In this way, the records or the clusters of records can be graphically rendered in a time that is substantially a linear function of the number of records in the data table. The geometric attribute primitives can include a position, a size, and a shape pattern of an element of a visualization, wherein the shape pattern can be rectangles, polygons, ellipses, and text, as well as other suitable custom shapes. The graphic attribute primitives can be color, pattern, font, and/or line width, by may also include shading, illumination effects, and the like. The visualization parameter structure can be organized as a hierarchical structure that includes sub-visualizations capable of rendering selected clusters. Specific embodiments may include Gantt charts, histograms and 2D- and scatter plots, tree structures, regular tables, as well as combinations, such as a series of histograms laid out as a Gantt chart or presented in a table, a 2D plot of tree nodes, and the like. The methods and programs described herein are also directed to computer programs with computer-executable code to provide a versatile and flexible user interface for generating visualizations in the manner described above.
Further features and advantages of the present invention will be apparent from the following description of preferred embodiments and from the claims.
Brief Description of the Drawings
The following figures depict certain illustrative embodiments of the invention in which like reference numerals refer to like elements. These depicted embodiments are to be understood as illustrative of the invention and not as limiting in any way. Fig. 1 is a schematic flow diagram of the visualization-rendering process of the invention; Fig. 2 is a more detailed description of the algorithm used to render visualizations, presenting Fig. 1 as an algorithm rather than a flow diagram;
Fig. 3 is a screen shot showing an exemplary organization of a display window; Fig. 4 is an exemplary representation of a population map; Fig. 5 shows the population map of Fig. 4 with embedded bar graphs; Fig. 5 A shows details of the bar graph of Fig. 5; Fig. 6 is an exemplary visualization rendered using the invention, depicting a Bar
Chart; and Fig. 7 shows another visualization obtained by sub-clustering the data of Fig. 6. Detailed Description of Certain Embodiments
A method and computer-executable program is described for graphically rendering information of a database which can be arranged in form of a table, and for extraction associations between database records. An element and/or an arithmetic expression computed from the records in the database can be associated with a graphic representation consisting of instructions, such as setting a fill color, drawing a rectangle, and so on. The proposed visualization-rendering method is computationally fast, since at most O(N) graphic instructions need to be generated for a database of N records.
The following exemplary definitions refer to terminology used throughout the specification: Data Table - a data set organized in a matrix of rows and columns. The columns are referred to as attributes of the data set. Each column can have a header, a type, and additional properties, such as the format in which the data is stored and whether the column is to be cumulated or averaged when a set of records is aggregated in a group. An instance of the data table has an arbitrary number of lines, often called records, or data objects. For each column, each line has an attribute value that describes the value of the given line for the given column. A data table can also be a table in a relational database.
Graphic Language - A graphic language is a set of programming language functions and data types (i.e. a library) that enable describing images on a computer screen, printer, or other computer-driven image-producing device. Modern graphic languages, such as OpenGL, Postscript, Microsoft® GDI, Java2D and Java3D consist of two types of functions, geometric attribute primitives and graphic attribute primitives: Geometric attribute primitives - describe shapes to be rendered on the screen using Graphic Languages. Examples of geometric attribute primitives are: drawRectangle (x, y, w, h) , drawlcon (bitmap, x, y) , dra Label ( "hello" , x, y) . Geometric attribute
. __ primitives also have positional parameters,, for example, drawRectangle ( 0 , 0 , 120 , 120 ) which will draw a rectangle at the origin of the display (0,0) spanning 120 pixels in both width and height. Graphic attribute primitives - describe graphic attributes, such as color, font, and pattern, that are used in drawing the associated geometric attribute primitives. For example, setColor ( "red" ) followed by drawRectangle ( 0 , 0 , 120 , 120 ) will result in a red rectangle being drawn at the coordinates (0, 0, 120, 120). setColor ( . . . ) can have more than one, for example three parameters that describe the RGB color space. It should be noted that there is at most one graphic attribute primitive of each sort being issued between two geometric attribute primitives.
Visualization - of a data table is a program, that, given as input any instance of the data table, outputs a uniquely defined sequence of graphic language instructions. Representation - a representation of a data table is the output of a visualization given a data table instance as entry. It consists of a sequence of graphic language instructions only. Expression of table attributes - is a sequence of programming language instructions that consists only of: constants, programming language operators (including functions), and data table attribute names. The attribute names stand for variables that, for each row of the column, will take the value of the current record for the current attribute. Local variable - a variable of the programming language that is reevaluated once for each record in the data table instance and stored as each record is examined - sequentially. Local variables store some state information that can be used, for example, to draw records next to or on top of each other. Expressions of Table Attributes And Local Variables (hereafter referred to as "ETALV") are the parameters of the visualization description method and include the aforementioned expression of table attributes and optionally local variables.
It should be noted that the time for computing an ETALV is independent of the number of records held in the table, because the ETALVs do not include instructions that operate on more than a single record at a time. Furthermore, table attributes and local variables need be computed only once for each record and can be cached in an array the size of the data table instance.
Compact visualization- Compact Visualizations are defined as those visualizations in which the graphic instruction sequence corresponding to any given data record or group of records is completely defined by a sequence of
ETALVs. Those ETALVs are the parameters of the visualization- generating algorithm described below. A compact visualization can be rendered in time directly proportional of the number of records held in a table instance, plus a sorting phase, which according to generally accepted criteria has a complexity of O(n log n) in time, with some other methods purportedly sorting at O(n). A Compact Visualization includes three distinct steps: (1) CLUSTER & SORT; (2) SHAPE SELECTION & LAYOUT; and (3) DECORATION.
In general, the invention is directed to a method and a computer-implemented program for describing and then rendering compact visualizations. Those provide a graphic layout of
— information contained in large databases that- -allows - a user to visually establish correspondences, find useful patterns, highlight trends in the datasets, and the like. The proposed method is efficient and can be customized depending on the application. Referring now to Fig. 1, the method for compact visualizations is depicted in a schematic high-level process flow diagram 100. The various steps of the process flow will be described in more detail below in the exemplary pseudocode and with reference to Figs. 2 and 3. In step 102 of Fig. 1, a user defines Expressions of Table Attributes And Local Variables (ETALV) to be applied to the compact visualization of records stored in table 103. Examples of ETALVs are described below with reference to Fig. 3. In step 104, the records are "clustered" into a hierarchy of clusters, whereby the data are organized in groups that share a common representation defined by the ETALV parameters of step 102. Thereafter, the data objects are sorted, step 106. Sorting is done after clustering, so that sorting is performed on smaller lists, resulting in significant optimization. In Shapes Selection & Layout, step 108, one or more graphic primitives (such as a rectangle, a line, etc.) and the coordinates of these primitives are associated with the data objects and/or groups of the sorted hierarchy of clusters. In a final step 110, the produced geometric graphic primitives are decorated, for example, by specifying a color, shading and the like, and the decorated graphic primitives are displayed, step 112. As mentioned above, the parameters used in the various steps are defined in the ETALVs. An instance of the class Visualization is defined by the ETALV parameters that describe a compact visualization. For each of those parameters, the user provides an Expression object that contributes to the definition of the visualization algorithm. The main function of the Visualization class is a function draw (Table , Frame) . It contains further objects that detail how each step is to be performed: class Visualization {
Identifier id;
List of..Variable __variables; .
Clusterer _clusterer;
Sorter _sorter; Layout _layout;
Decoration _decoration;
List of Visualization _subVisualizations ;
List of pair<Identifier, Table> _clusters; function draw(Table, Frame); };
The Visualization-rendering algorithm can be expressed in pseudocode as follows:
200 Visualization: : draw (Table t, Frame f) {
204 clusterer . initialize (t, this);
206 sorter. sort ( clusters, this);
208 for each variable v in variables v. initialize (t, this);
210 for each pair<Identifier, Table> i in _clusters { Visualization curViz =
212 subVisualizations . find(i . identifier) ; 214 if (curViz ==null) curViz = this;
216 for each graphic shape g in curViz ._layout 218 curViz ._decoration. draw (i .table, curViz) 220 g. draw (i . table, curViz);
222 curViz .draw (i .table, f) ;
224 for each variable v in _variables v. incrementVariables (i, this);
Table 1
In the above algorithm, t represents a set of records of the data table; f represents a frame of an object (screen, window); i, j are local counters; and CurViz is the Visualization structure defined by the ETALVs. A visualization-rendering process assumes that the screen has been cleared before starting, and starts by setting the counter i=0. Before describing the
Clusterer, Sorter, Layout, and Decoration classes in detail, the concept of expressions and local variables is introduced.
The class Expression: class Expression {
Obj ect evaluate (Record r, Visualization v) ;
} is used to parameterize each of the Clusterer,- Sorter, Layout and Decoration classes. It is the implementation of an ETALV as defined above. The evaluate function can be defined by the user to return any object of the language, and it can perform any type of computation allowed by the program, using the values held in the r (Record)-attributes and v (Visualization)- variables. The present implementation provides a simple and convenient expression parser that allows the user to define expression such as a sum of columns, a constant or any simple numeric expression of the language.
In addition, the real Expression class is enhanced with the ability to synthesize a value for a group (a sub-table) of records. Accordingly, a Synthesizer class is added as follows: class Expression { Object evaluate (Record r, Visualization v) ; Object evaluate (Table t, Visualization v) { Synthesizer . evaluate (c, this, v) ;
} Synthesizer synthesizer;
class Synthesizer { virtual Object evaluate (Table t, Expression e, Visualization v) ;
} class SumSynthesizer implements Synthesizer {
Object evaluate (Table t, Expression e, Visualization v) { Integer result=0;
// computes the sum of the values of expression e for the // whole table for each record r in t { result = result + e . evaluate (r, v) ; } return result;
class MinSynthesizer implements Synthesizer { Object evaluate (Table t, Expression e, Visualization v) { Integer result=MAXINT; // computes the minimum value of expression e for the // whole table-- -- ~ for each record r in t { Integer temp=e . evaluate (r, v) ;
If (temp<result) result=temp; return result; }
// and so forth for Max, average, or
// other statistical moments: median, interval...
The synthesizer class returns a condensed single value reflecting the values held in the sub-table. Because different types of attributes may have to be synthesized differently, a number of predefined synthesizers is provided, and users can add their own. For example, in a computer file system, file sizes are additive attributes, as are, in the exemplary database of cities used below, Populations. This means that the size of a group of files is the sum of their individual sizes, or that the Population of a table of cities is the sum of their populations. Not all attributes are additive. For example, if the attribute is the creation date of a file, then the creation date for a table of size is the minimum date of all creation dates. For a last modification date, it is the maximum value of all modifications date (i.e. the latest date of all records held in the table). If no synthesizer is provided by the user, the values of the expression for each record are averaged. Therefore, an expression can be evaluated in the same way, interchangeably either for a single record or for a table (or sub-table) of records. As mentioned before, a visualization can define local variables. These are defined in the class
Variable: class Variable { String name;
Expression initExpression; Expression incrementExpression; Obj ect currentValue ; Object getValue(); void initialize (Table t, Visualization v) { currentValue=initExpression. evaluate (t, v) ; v}oid increment (Record r, Visualization v) { currentValue=incrementExpression. evaluate (r, v) ;
. }
This class implements the definition of Local Variable defined above. The variables are defined by the user, who gives them a name, that will be used in Expression objects to access their current value, an initial expression that is evaluated to give the expression a default value at the start of each visualization-rendering process (step 208 of Table 1), and an incremental expression that is evaluated once for each record or group of records that the visualization process handles (after step 224 of Table 1).
Referring now to Fig. 2, for a better understanding of the visualization-rendering algorithm, the pseudocode commands of Table 1 are illustrated in form of a flowchart 200, with the reference numerals in the flowchart 200 having a one-to-one correspondence to the pseudocode commands. A visualization-rendering process starts by setting the counter i=0, step 202. In step 204, the records of the data table instance that share a common representation and common local variables are "clustered" or grouped/aggregated based on a set of local variables and graphic primitives that are to be shared among one or several groups of records of the table. To preserve the linear rendering time constraints characteristic of compact visualizations, the clustering process represents a partitioning operation, that is, a given record belongs to one and only one cluster.
A class Clusterer is defined to perform the operational step 204 of clustering records into groups of records:
class Clusterer {
Expression tableExpression; function initialize (Table, Visualization); function Identifier cluster (Record r, Visualization v) { return tableExpression. evaluate (r, v) ;
}
} with the function initialize defined as follows:
Figure imgf000014_0001
Table 2 Table 2 represents the pseudocode of the clustering process, which involves splitting a table into a group of smaller tables or individual records that each share a common property. The process takes as input a table t, a Visualization v (to enable accessing the local variables it defines) and a clustering expression, here implemented as the cluster subroutine. The clustering process 300 starts by erasing any cluster that could have been stored in the table of sub-clusters, line 302. Then, for each record held in the table t, the cluster function is called to evaluate in which class the record should belong, line 306. In line 308, the table of sub-tables held in v is examined if there is a sub-table existing for this identifier. If there is none, line 312, a new sub-table associated with identifier curld is added. Then, line 314, the current record is added to the corresponding sub-table. The process 300 is executed for each record held in Table t by returning to line 304. This process results in Table t being divided in a series of sub-tables, each identified by an identifier Id and stored (cached) in the Vi sual i zat ion data structure. The identifier Curld returned by the cluster routine and used to define each record's class is provided by evaluating the user-defined ETALVs. Examples of clustering are grouping data according to the prefixes of a string of characters held in an attribute
(StringClusterer) , or according to a date attribute (DateClusterer) , for example, year, month, day, hour, and minute. As mentioned in the description of the class Expression, each cluster can be seen as a record whose values are synthesized from the records it holds. For each-attribute of the data table, one can specify how the records- are to be aggregated when the records are grouped. Typically, sizes are added; the starting date of a cluster of records is the minimum of the starting dates of its records, a numerical quantity can be synthesized by the average (arithmetic, geometric, or harmonic) of its values. Default classes are provided for commonly used ways to aggregate a set of records into one single cluster to be viewed as a record. The number of records that can be aggregated is variable.
For example, a function StringClusterer would be defined as follow: class StringClusterer extends Clusterer { Expression stringExpression; } ;
Identifier StringClusterer : : cluster (Record r, Visualization v) { // stringExpression simply returns the string held in r. return stringExpression. evaluate (r, v) ; }
class DateClusterer extends Clusterer { Expression dateExpression;
};
Identifier StringClusterer :: cluster (Record r,
Visualization v) {
/* dateExpression is an ETALV that returns the year held in r when v is empty, the month held in r when the list of subexpressions is a list of months, and so forth for day, hours, minutes and seconds. */ return dateExpression. evaluate (r, v) ; }
A special Clusterer type could be implemented to cluster records together not according to some endogenous property, but using exogenous variables, wherein the placement of the sub-clusters is determined heuristically. This could be used, for example, to implement features such as 'squarified treemaps,' which does not represent a true compact visualization. Valid representations may still be produced, however with a reduced efficiency.
After computing the clusters, but before shape selection and layout, the clusters and records are sorted in a desired order, step 206, for example, according to the value of attributes in a data table. The following exemplary program sorts a table according to an attribute of the data table.
class Sorter {
Expression numericExpression; Visualization current_visualization; function int compare (Record a, Record b) ; function sort (Table t, Visualization v) ;
} int compare (Record a, Record b) { return numericExpression. evaluate (a, current visualization) - numericExpression. evaluate (b, current visualization);
};
Sorter :: sort (Table t, Visualization v) { current_visualization=v; quicksort (t, compare) ;
Quicksort is a sorting algorithm, for example, the algorithm defined by Hoare in 1962 and available in most programming languages: Java, C++, etc.. Quicksort uses the function compare as a comparison function and is not linear in time, but sufficiently efficient to be useful with compact visualizations. Alternatively, a "radix" sort technique known in the art could be employed to further optimize the process and provide a linear rendering time.
After initializing each variable v, step 208, and defining the current visualization as expressed by the visualization parameters, step 212, the Shape Selection & Layout process assigns one or several geometric attribute primitives, such as shapes, to a record or cluster of records, steps 214 and 216. These shapes can include, for example, rectangles, text labels, lines, segments of arcs, and other geometric attribute primitives, such as disk portion primitives suitable for circular and pie charts as well as line primitives, to enable regular line charts as well as construction of objects by straight lines, such as rectangles, polygons, and Bezier curves.
For each assigned primitive, a DECORATION process, step 218, sets certain graphic attributes for each record being drawn. Such attributes can be color (expressed in RGB, CYM, etc.), patterns, transparency, light coordinates used in certain illumination models described by 3D graphic languages, such as OpenGL and Java3D, line bevel properties, line thickness, and so on. Additional parameters can specify margins, borders, scale, ruler information, and the position of text labels relative to the center of the geometric attribute primitives. In the described process 200, the decoration phase is shown as being implemented after the layout phase, since each primitive shape generated by LAYOUT process is determined before being decorated. Thus, decoration is specific for the layout and shape selection. However, those skilled in the art will appreciate that the order in which the steps are performed need not be the same as the order of the logical dependencies between process steps. 5 The following program defines a Color Decoration class:
class ColorDecoration extends Decoration { Expression redComponent; Expression greenComponent; 10 Expression blueComponent ; function dra (Record r, Visualization v) { setColor (redComponent .evaluate (r,v) , redComponent . evaluate (r,v) , redComponent . evaluate (r,v) ) ; 15 }
} ;
A composite decoration enables condensing multiple decoration attributes:
-20— class CombinedDecoration extends—Decoration {
ColorDecoration color;
PatternDecoration pattern; function draw (Record r, Visualization v) { color.draw (r, v) ; 25 pattern. draw (r, v) ; > , }
As mentioned above, depending on the power of expression of the graphic language, various
30 other parameters can be used as decorations, such as transparency, light coordinates, line bevel properties, line thickness.
The Layout step 220 includes assigning parameters to each of the graphic primitives representing a record or a cluster. A parameter can be unspecified, in which case this dimension will be maximized, or it can be an expression of table attributes or local variables.
35 Exemplary layout shapes are rectangles and text. A layout class for rectangles can be defined as: class RectangleLayout { Expression x; Expression y;
Expression width;
Expression height; function draw (Record r, Visualization v) { fillRectangle (x. evaluate (r, v) , y. evaluate (r,v) , width. evaluate (r, v) , height . evaluate ( , v) ) ;
} }
Text labels have a further "label" parameter. The most obvious assignment of this parameter is to assign it an attribute name, such as the city name in our sample table data. class TextLayout extends RectangleLayout { Expression label; function draw (Record r, Visualization v) { super .draw (r,v) ; setColor (0, 0, 0) ; // black. // Additional customization can be provided to allow the user // to choose a text color. drawLabel (label . evaluate (r,v) , x. evaluate (r,v) + 2, y. evaluate (r,v) + 2) ; };
It will be understood by those skilled in the art that a layout with several geometric attribute primitives can easily be derived from the above classes. A list of layouts and decorations, one for each attribute of the data table, is maintained for each record that is-to-be drawn, step 216. In step 224, local variables are evaluated once for each record or cluster of records to be rendered. These user-defined local variables often define a current position that is initialized to "0" and incremented by a height (to arrange objects in rows) or a width (to arrange objects in columns) of each record or cluster. The visualization-rendering process is recursive, as described above with reference to steps 212 - 222, i.e., a cluster can be further partitioned into sub-clusters that group together records sharing some common display properties. As seen in step 212, the visualization can describe different representations at each level in the hierarchy: a visualization data structure is a hierarchy that can describe sub- visualizations that will be used to render selected clusters. By default, step 214, all levels of sub-clusters use the same visualization. The invention will now be described with reference to several screenshots illustrated in the drawings. To provide an exemplary database, statistical data from approximately 300 US cities were gathered and arranged in a table format. Each city is referenced by city name, state, population, longitude, latitude, a case and study numbers (an identifier referring to the origin of the data) as well as the following numerical attributes expressed in relative numbers: climate, housing cost, quality of health care, crime rate, transportation, quality of education, arts, recreation and strength of the economy.
Referring first to Fig. 3, after the application program is started, a graphic user interface (GUI) 30 shows several windows 31, 32, 33, 34, and 35 allowing a user to manipulate data of the database table and display them using various visualizations. Window 31 is referred to as "Model Inspector", which displays at least one row of the columns in the data table to be visualized. Window 32 is referred to as "Projection Inspector", which displays the various parameters that can be set to-create a visualization, as well as their current settings. Window 33 is referred to as "Data Table View", which displays the representation of the data table in window 31, according to the visualization parameters specified in window 32. As can be seen from a closer inspection of window 32, the window 32 includes the fields "clustering", "sorting", "shape selection" (frame), "layout" and "color" and "fill" (decoration) that are the necessary steps defining a compact visualization, as described above. The user can select the table attributes and local variables (ETALV) by changing the settings of these parameters in window 32, thereby allowing in a simple manner a change in the displayed representations. As can be seen from the settings in window 32 depicted in Figs. 4-7, all the different displayed representations are rendered by simply changing a few parameter settings in window 32. For usability reasons, the definition of local variables is hidden in the graphical user interface; those are replaced by preset buttons, such as the "alternate X/Y" button, that define often used variables. This is illustrated later, in figure 6. The full features are available in the text-based style sheets generated by the "projection inspector". Window 34 is referred to as "Shelf. In this window displays in form of icons other visualizations of the same data table. These other visualizations can be recalled by clicking on the corresponding icon. Also provided are menu bars 35 that allow manipulation of the various windows, loading and storing of visualizations, data tables, cut/copy/paste operations and so forth. Many of these features are known from standard GUI interfaces. In one embodiment, a parameter in window 32 is selected and assigned a column name of the data table, or an arithmetic expression computed from arithmetic values in some of the columns. In a simple example, the values in the "Population" column can be associated with the "Color" attribute in window 32, and the items in the table can be sorted by "Population". This assignment of values would fill window 33 with horizontally extending rows of rainbow colors that are stacked in the vertical direction, thereby filling the window 33.
Another exemplary display 40 in form of a scatter- plot using -the same database as before is depicted in Fig. 4. In the initial step, the X coordinate 42 is assigned to the "Longitude" table attribute of the cities and the Y coordinate 44 to the "Latitude" attribute. This produces a 2D scatter plot of the cities on a map of the United States. Both the "Size" attribute 45 and the "Color" attribute 46 are set to "Population". The "Fill" attribute 47 is set to "Plain". Representations of this type are known in the art. However, as described above with reference to the data displayed in window 31 of Fig. 3, each city or area (and for the same reason each attribute in the table) has associated therewith other values, such as climate, housing cost, quality of health care, crime rate, transportation, etc., which can also be efficiently rendered graphically. Referring now to Fig. 5, instead of representing each city by a rectangle with a width corresponding to the population, as previously discussed with reference to display 40 of Fig.
4, according to the invention, display 50 displays a histogram 510 in form of a bar graph for each city or region, wherein the total width of the bar graph represents the population and the height of the individual bars represents other table attributes. The histograms were generated by the application program simply by changing the "Fill" attribute 57 from "Plain" in Fig. 4 to "Bar chart" in Fig. 5.
A detailed view of the histogram 510 is presented in Fig. 5 A. The bars 502, 503, ... , 508 may represent the values of the attributes selected from window 31. It may be possible for a user to click on the histogram to enlarge the scale and/or display the histogram for that city in a separate window (not shown). The display of the histogram is controlled by commands entered, for example, by mouse clicks, in window 32.
Displaying the additionally selected attributes as bar graphs advantageously produces a simultaneous visualization of the database attributes for all cites on the map. This can give an easily recognizable indication as to which cities have better living conditions: those that are close to a full rectangle have better indices for all the above mentioned categories, while those that show big indentations have one or more weak areas.
According to the invention, instead of assigning a single geometric attribute primitive to an attribute, graphic attributes can be recursively assigned to sub-components of an object. Not all visualization attributes need to be assigned to a column of the data model. Instead, they can be associated with an expression of the programming language, which can place them next to each other.
This is illustrated in Fig. 6. In display 60, X 62 is assigned a "Packing" constant, while Y 64 is set to "0" and Height to "1". The "packing" constant in the user interface sets the width to a local variable of value [Population of the Record]/[Total Population for the Cluster] and defines one local variable, curX, that is initialized to 0 and is incremented by width for each record. This has the effect of placing the cities next to each other, ordered by decreasing population. Vertically stacked are the histograms for each record, i.e., climate, housing, health care quality, etc., creating a vertical bar for each attribute in the data table. The displayed width 68 of each bar of the histogram is proportional to the population. This means that the area of each bar (population x attribute index) is proportional to the total number for this table attribute. For example, the "Education" index 67 (height of the bar), shown in the fourth row of Fig. 6, for Philadelphia is greater that of New- York and LA. However, the area under the respective bars for New- York and LA is larger than the area for Philadelphia. So, even though Philadelphia has a larger proportion of students and teachers in its population, NY and LA have a larger absolute student population.
Referring now to Fig. 7, by selecting a suitable "Clustering" parameter 76, records in display 70 can be grouped and each group rendered using another sub-visualization. For example, Fig. 7 groups (clusters/aggregates) cities belonging to the same State, creating a matrix where each square represents the aggregated value for one State. Different parameters for each group of cities can be specified inside a squarerFor-example, in the small 2D scatter plots 78,
79 in the top three rows, the "Housing Cost" index is plotted against the "Climate" index for each State with more than 4 cities in the sample. This visualization shows some States (California (CA) 78 and Washington State (WA) 79) exhibit a strong correlation between Housing cost and Climate, whereas no such correlation is evident for other States, such as Texas (TX) 77. Different visualizations can be displayed simultaneously, for example side- by-side, in different windows by easy viewing.
In summary, the limited number of parameters in window 32 allows flexibility in moving from one visualization to another. The visualizations are "compact" and can be rendered in a time that depends linearly on the number of records in the database. In addition, a novel recursive model of visualizations within visualizations can be built by "clustering" attributes in the table database. Each cluster can have its own parameter set specifying, for example, layout, decoration, and sorting settings. Therefore, when changing a parameter, the user can specify whether they want the parameter change to be applied to
• the currently selected cluster and all its sub-clusters
• or the currently selected cluster only
• or the direct immediate children of the selected cluster only
• or the leaf children of the selected cluster only.
Depending on which option has been chosen, visualizations can be created to allow particular clusters to represent their content differently. The disclosed method provides access to all visualizations that can be displayed in linear time, i.e., as a linear function of the number of records in the database.
While the invention has been disclosed in connection with the preferred embodiments shown and described in detail, various modifications and improvements thereon will become readily apparent to those skilled in the art. For example, the disclosed method is not limited by the size of the data structure "and could be extended to use data structures other than "a data table. Instead of using a list of typed attributes, the data schema could, for example, be a regular XML DTD. Each type of object to be presented would have to be placed, for example, in a predefined cluster. Layout constraints would become much more complex, as the possible visualizations would no longer have a canonical representation. This class of visualizations would be more difficult to customize, and the efficiency of the described visualization method may not be preserved. Accordingly, the spirit and scope of the present invention is to be limited only by the following claims.

Claims

Claims
1. Method for graphically representing records arranged in a data table having columns and rows, comprising: defining a set of visualization parameters for the data table; clustering the records of the data table that have a common cluster parameter value in the visualization set of parameters; sorting the clustered records in a predetermined order defined by a sort parameter value in the visualization set of parameters; associating a geometric attribute primitive with the record or the cluster of records, said geometric attribute primitive defined by a shape selection parameter value in the visualization set of parameters for said record or the cluster of records; associating a graphic attribute primitive with an associated geometric attribute primitive of the record or the cluster of records, said graphic attribute primitive defined by a decoration parameter value in the visualization set of parameters for said record or the cluster of records; and graphically representing the sorted records or the clusters of records having the associated geometric and graphic attribute primitives.
2. The method of claim 1, wherein the set of visualization parameters for said records or the cluster of records in the database is arranged in form of a style sheet.
3. The method of claim 1, wherein a visualization parameter of the set operates on a single record or cluster of records of the database.
4. The method of claim 1, wherein the set of visualization parameters comprises parameters selected from the group consisting of a column of the data table, a local variable name, and programming language operators.
5. The method, of claim 1, wherein the records or the clusters of records are graphically rendered in a time that is substantially a linear function of the number of records in the data table.
6. The method of claim 1, wherein the geometric attribute primitives are selected from the group consisting of position, size, and shape.
7. The method of claim 6, wherein the shape is selected from the group consisting of rectangle, polygon, ellipse, line and text.
8. The method of claim 1, wherein the graphic attribute primitives are selected from the group consisting of color, pattern, font, and line width.
9. The method of claim 1, further comprising: selecting clusters from the clustered records; associating sub-visualization parameters with selected clusters; and performing the steps of associating and graphically rendering on the selected clusters.
10. The method of claim 1, wherein graphically rendering includes displaying visualizations selected from the group consisting of Gantt charts, histograms and 2D- and scatter plots, tree structures, and data tables.
11. Computer program comprising computer-executable code for causing a computer to: define a set of visualization parameters for a record of a database, the database arranged in form of a data table having columns and rows; cluster the records of the database that have a common cluster parameter value in the visualization set of parameters; sort the clustered records in a predetermined order defined by a sort parameter value in the visualization set of parameters; associate a geometric attribute primitive with the record or the cluster of records, said geometric attribute primitive defined by a shape selection parameter value in the visualization set of parameters for said record or the cluster of records; associate a graphic attribute primitive with an associated geometric attribute primitive of the record or the cluster of records, said graphic attribute primitive defined by a decoration parameter value in the visualization set of parameters for said record or the cluster of records; and graphically represent the sorted records or the clusters of records having the associated geometric and graphic attribute primitives.
12. The computer program of claim 11, wherein the set of visualization parameters for said records or the cluster of records in the database are arranged in form of a style sheet.
13. The computer program of claim 11, wherein a visualization parameter of the set operates on a single record or cluster of records of the database.
14. The computer program of claim 11, wherein the records or the clusters of records are graphically rendered in a time that is substantially a linear function of the number of records in the data table.
15. The computer. rogram of claim 11, wherein the geometric attribute primitives are selected from the group consisting of position, size, and shape.
16. The computer program of claim 15, wherein the shape is selected from the group consisting of rectangle, polygon, ellipse, line and text.
17. The computer program of claim 11, wherein the graphic attribute primitives are selected from the group consisting of color, pattern, font, and line width.
18. The computer program of claim 11 , further comprising: selecting clusters from the clustered records; associating sub-visualization parameters with selected clusters; and performing the steps of associating and graphically rendering on the selected clusters.
19. The computer program of claim 11 , wherein graphically rendering includes displaying visualizations selected from the group consisting of Gantt charts, histograms and 2D- and scatter plots, tree structures, and data tables.
20. Computer program code embodied in a computer-readable medium, comprising: computer-executable program code for defining a set of visualization parameters for a record of the database arranged in a data table having columns and rows; computer-executable program code for clustering the records of the database that have a common cluster parameter value in the visualization set of parameters; computer-executable program code for sorting the clustered records in a predetermined order defined by a sort parameter value in the visualization set of parameters; computer-executable program code for associating a geometric attribute primitive with the record or the cluster of records, said geometric attribute primitive defined by a shape selection parameter value in the visualization set of parameters for said record or the cluster of records; computer-executable program code for associating a graphic attribute primitive with an associated geometric attribute primitive of the record or the cluster of records, said graphic attribute primitive defined by a decoration parameter value in the visualization set of parameters for said record or the cluster of records; and computer-executable program code for graphically rendering the sorted records or the clusters of records having the associated geometric and graphic attribute primitives.
PCT/EP2003/002070 2002-02-28 2003-02-27 Interactive generation of graphical visualizations of large data structures WO2003073319A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003210385A AU2003210385A1 (en) 2002-02-28 2003-02-27 Interactive generation of graphical visualizations of large data structures

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/086,964 US6928436B2 (en) 2002-02-28 2002-02-28 Interactive generation of graphical visualizations of large data structures
US10/086,964 2002-02-28

Publications (2)

Publication Number Publication Date
WO2003073319A2 true WO2003073319A2 (en) 2003-09-04
WO2003073319A8 WO2003073319A8 (en) 2004-03-18

Family

ID=27765355

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2003/002070 WO2003073319A2 (en) 2002-02-28 2003-02-27 Interactive generation of graphical visualizations of large data structures

Country Status (3)

Country Link
US (1) US6928436B2 (en)
AU (1) AU2003210385A1 (en)
WO (1) WO2003073319A2 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8880487B1 (en) 2011-02-18 2014-11-04 Pegasystems Inc. Systems and methods for distributed rules processing
US8959480B2 (en) 2004-05-26 2015-02-17 Pegasystems Inc. Methods and apparatus for integration of declarative rule-based processing with procedural programming in a digital data-processing environment
US9658735B2 (en) 2006-03-30 2017-05-23 Pegasystems Inc. Methods and apparatus for user interface optimization
US9678719B1 (en) 2009-03-30 2017-06-13 Pegasystems Inc. System and software for creation and modification of software
US10469396B2 (en) 2014-10-10 2019-11-05 Pegasystems, Inc. Event processing with enhanced throughput
US10467200B1 (en) 2009-03-12 2019-11-05 Pegasystems, Inc. Techniques for dynamic data processing
US10572236B2 (en) 2011-12-30 2020-02-25 Pegasystems, Inc. System and method for updating or modifying an application without manual coding
US10698599B2 (en) 2016-06-03 2020-06-30 Pegasystems, Inc. Connecting graphical shapes using gestures
US10698647B2 (en) 2016-07-11 2020-06-30 Pegasystems Inc. Selective sharing for collaborative application usage
US11048488B2 (en) 2018-08-14 2021-06-29 Pegasystems, Inc. Software code optimizer and method
US11567945B1 (en) 2020-08-27 2023-01-31 Pegasystems Inc. Customized digital content generation systems and methods

Families Citing this family (95)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7386835B1 (en) * 2002-03-22 2008-06-10 Emc Corporation Technique for graphical user interface modification
US7181460B2 (en) * 2002-06-18 2007-02-20 International Business Machines Corporation User-defined aggregate functions in database systems without native support
US7412658B2 (en) * 2002-11-14 2008-08-12 Sap Ag Modeling system for graphic user interface
GB2403636A (en) * 2003-07-02 2005-01-05 Sony Uk Ltd Information retrieval using an array of nodes
WO2005015354A2 (en) * 2003-08-06 2005-02-17 Infotrend, Inc. Population analysis using linear displays
US7310780B2 (en) * 2003-08-14 2007-12-18 International Business Machines Corporation Methods, systems and computer program products for visually tethering related graphical objects
US20050057566A1 (en) * 2003-09-11 2005-03-17 International Business Machines Corporation Rich graphic visualization generation from abstract data representation
US7610549B2 (en) * 2004-05-20 2009-10-27 Sap Ag Method and system for Java Gantt/bar chart rendering
US7721197B2 (en) * 2004-08-12 2010-05-18 Microsoft Corporation System and method of displaying content on small screen computing devices
US20060033737A1 (en) * 2004-08-16 2006-02-16 Old William M Methods and system for visualizing data sets
US8001476B2 (en) 2004-11-16 2011-08-16 Open Text Inc. Cellular user interface
US8418075B2 (en) 2004-11-16 2013-04-09 Open Text Inc. Spatially driven content presentation in a cellular environment
US8239394B1 (en) 2005-03-31 2012-08-07 Google Inc. Bloom filters for query simulation
US7587387B2 (en) 2005-03-31 2009-09-08 Google Inc. User interface for facts query engine with snippets from information sources that include query terms and answer terms
US7953720B1 (en) 2005-03-31 2011-05-31 Google Inc. Selecting the best answer to a fact query from among a set of potential answers
US20060230083A1 (en) * 2005-04-08 2006-10-12 Microsoft Corporation Method and system for profiling and populating a database
CN104182459B (en) * 2005-12-01 2019-03-08 皇家飞利浦电子股份有限公司 System and method for content to be presented to the user
US7925676B2 (en) 2006-01-27 2011-04-12 Google Inc. Data object visualization using maps
US7778952B2 (en) * 2006-01-27 2010-08-17 Google, Inc. Displaying facts on a linear graph
US8954426B2 (en) 2006-02-17 2015-02-10 Google Inc. Query language
US20070185870A1 (en) 2006-01-27 2007-08-09 Hogue Andrew W Data object visualization using graphs
US8055674B2 (en) * 2006-02-17 2011-11-08 Google Inc. Annotation framework
US8015056B1 (en) * 2006-06-20 2011-09-06 Fenstermaker William H Method for visual presentation of key performance indicators of a business utilizing a squarified tree map
US7831928B1 (en) 2006-06-22 2010-11-09 Digg, Inc. Content visualization
US8583695B2 (en) * 2006-06-28 2013-11-12 Bank Of America Corporation Method and apparatus for evaluating statistical significance over time
CN100504868C (en) * 2006-06-30 2009-06-24 西门子(中国)有限公司 Tree structures list display process having multiple line content node and device thereof
US20080082559A1 (en) * 2006-09-28 2008-04-03 Gm Global Technology Operations, Inc. Method of linking information to an electronically enabled manufactured part archive
US8954412B1 (en) 2006-09-28 2015-02-10 Google Inc. Corroborating facts in electronic documents
WO2008043042A2 (en) * 2006-10-05 2008-04-10 Educational Testing Service Data structure for defining a chart
JP4296521B2 (en) * 2007-02-13 2009-07-15 ソニー株式会社 Display control apparatus, display control method, and program
US8250525B2 (en) 2007-03-02 2012-08-21 Pegasystems Inc. Proactive performance management for multi-user enterprise software systems
US8347202B1 (en) 2007-03-14 2013-01-01 Google Inc. Determining geographic locations for place names in a fact repository
US8910084B2 (en) * 2007-05-07 2014-12-09 Oracle International Corporation Aggregate layout for data visualization techniques
US8640056B2 (en) * 2007-07-05 2014-01-28 Oracle International Corporation Data visualization techniques
US8139063B2 (en) * 2007-05-07 2012-03-20 Oracle International Corporation Rendering data visualization with minimal round-off error
US9477732B2 (en) * 2007-05-23 2016-10-25 Oracle International Corporation Filtering for data visualization techniques
US8286100B2 (en) * 2007-07-05 2012-10-09 Oracle International Corporation Linking graphical elements of data visualizations
US8866815B2 (en) * 2007-05-23 2014-10-21 Oracle International Corporation Automated treemap configuration
US8239751B1 (en) 2007-05-16 2012-08-07 Google Inc. Data from web documents in a spreadsheet
US8090747B2 (en) * 2007-05-21 2012-01-03 New York University Method, system, computer-accessible medium and software arrangement for organization and analysis of multiple sets of data
US8786628B2 (en) * 2007-09-14 2014-07-22 Microsoft Corporation Rendering electronic chart objects
US9135331B2 (en) 2008-04-07 2015-09-15 Philip J. Rosenthal Interface including graphic representation of relationships between search results
US8306971B2 (en) * 2008-06-20 2012-11-06 Tableau Software, Inc. Methods and systems of automatically geocoding a dataset for visual analysis
US8620635B2 (en) 2008-06-27 2013-12-31 Microsoft Corporation Composition of analytics models
US8411085B2 (en) 2008-06-27 2013-04-02 Microsoft Corporation Constructing view compositions for domain-specific environments
US8539444B2 (en) * 2008-06-30 2013-09-17 International Business Machines Corporation System and method for platform-independent, script-based application generation for spreadsheet software
JP4636141B2 (en) * 2008-08-28 2011-02-23 ソニー株式会社 Information processing apparatus and method, and program
US8499284B2 (en) * 2008-09-11 2013-07-30 Microsoft Corporation Visualizing relationships among components using grouping information
US20100082385A1 (en) * 2008-10-01 2010-04-01 International Business Machines Corporation System and method for determining temperature of business components for finding business transformation opportunities
US8175911B2 (en) * 2008-10-01 2012-05-08 International Business Machines Corporation System and method for inferring and visualizing correlations of different business aspects for business transformation
US8145518B2 (en) * 2008-10-01 2012-03-27 International Business Machines Corporation System and method for finding business transformation opportunities by analyzing series of heat maps by dimension
US8359216B2 (en) * 2008-10-01 2013-01-22 International Business Machines Corporation System and method for finding business transformation opportunities by using a multi-dimensional shortfall analysis of an enterprise
US8176096B2 (en) * 2008-12-18 2012-05-08 Microsoft Corporation Data visualization interactivity architecture
US8314793B2 (en) 2008-12-24 2012-11-20 Microsoft Corporation Implied analytical reasoning and computation
US8638343B2 (en) * 2009-04-30 2014-01-28 Microsoft Corporation Data visualization platform performance optimization
US9250926B2 (en) * 2009-04-30 2016-02-02 Microsoft Technology Licensing, Llc Platform extensibility framework
US20100306708A1 (en) * 2009-05-29 2010-12-02 Rovi Techonologies Corporation Systems and methods for handling profiles in a community
US8531451B2 (en) 2009-06-19 2013-09-10 Microsoft Corporation Data-driven visualization transformation
US8692826B2 (en) 2009-06-19 2014-04-08 Brian C. Beckman Solver-based visualization framework
US9330503B2 (en) 2009-06-19 2016-05-03 Microsoft Technology Licensing, Llc Presaging and surfacing interactivity within data visualizations
US8866818B2 (en) 2009-06-19 2014-10-21 Microsoft Corporation Composing shapes and data series in geometries
US8493406B2 (en) 2009-06-19 2013-07-23 Microsoft Corporation Creating new charts and data visualizations
US8788574B2 (en) 2009-06-19 2014-07-22 Microsoft Corporation Data-driven visualization of pseudo-infinite scenes
US9396241B2 (en) 2009-07-15 2016-07-19 Oracle International Corporation User interface controls for specifying data hierarchies
US9087059B2 (en) 2009-08-07 2015-07-21 Google Inc. User interface for presenting search results for multiple regions of a visual query
US9135277B2 (en) 2009-08-07 2015-09-15 Google Inc. Architecture for responding to a visual query
US8352397B2 (en) 2009-09-10 2013-01-08 Microsoft Corporation Dependency graph in data-driven model
US8161374B2 (en) * 2009-10-23 2012-04-17 Microsoft Corporation Butterfly diagrams enabling multi-dimensional performance analysis
US8954893B2 (en) * 2009-11-06 2015-02-10 Hewlett-Packard Development Company, L.P. Visually representing a hierarchy of category nodes
US20110191699A1 (en) * 2010-02-02 2011-08-04 Dynavox Systems, Llc System and method of interfacing interactive content items and shared data variables
US9043296B2 (en) 2010-07-30 2015-05-26 Microsoft Technology Licensing, Llc System of providing suggestions based on accessible and contextual information
US20120102419A1 (en) * 2010-10-22 2012-04-26 Microsoft Corporation Representing data through a graphical object
US9202297B1 (en) 2011-07-12 2015-12-01 Domo, Inc. Dynamic expansion of data visualizations
US9792017B1 (en) * 2011-07-12 2017-10-17 Domo, Inc. Automatic creation of drill paths
US9501849B2 (en) * 2012-05-11 2016-11-22 Vmware, Inc. Multi-dimensional visualization tool for browsing and troubleshooting at scale
US20140282163A1 (en) * 2013-03-15 2014-09-18 Jock Douglas MacKinlay Tree Map
US9235564B2 (en) * 2013-07-19 2016-01-12 International Business Machines Corporation Offloading projection of fixed and variable length database columns
AU2014100875A4 (en) * 2013-08-06 2014-09-04 New Bis Safe Luxco S.À R.L Methods, apparatus and systems for data sorting, visualisation and related applications
US20150281292A1 (en) * 2014-03-25 2015-10-01 PlusAmp, Inc. Data File Discovery, Visualization, and Actioning
US9710430B2 (en) * 2014-05-09 2017-07-18 Sap Se Representation of datasets using view-specific visual bundlers
US10089372B2 (en) 2014-05-09 2018-10-02 Sap Se Data visualization using level of detail magnification
WO2015181897A1 (en) * 2014-05-27 2015-12-03 株式会社日立製作所 Management system for managing information system
WO2016065173A1 (en) * 2014-10-22 2016-04-28 Alibaba Group Holding Limited Method and apparatus for displaying statistical graphs on a terminal display
US10630557B2 (en) * 2015-10-19 2020-04-21 Nicira, Inc. Virtual network management
US10861202B1 (en) 2016-07-31 2020-12-08 Splunk Inc. Sankey graph visualization for machine data search and analysis system
US11037342B1 (en) * 2016-07-31 2021-06-15 Splunk Inc. Visualization modules for use within a framework for displaying interactive visualizations of event data
US10459938B1 (en) 2016-07-31 2019-10-29 Splunk Inc. Punchcard chart visualization for machine data search and analysis system
US10853380B1 (en) 2016-07-31 2020-12-01 Splunk Inc. Framework for displaying interactive visualizations of event data
US10459939B1 (en) 2016-07-31 2019-10-29 Splunk Inc. Parallel coordinates chart visualization for machine data search and analysis system
US9836183B1 (en) * 2016-09-14 2017-12-05 Quid, Inc. Summarized network graph for semantic similarity graphs of large corpora
US11068121B2 (en) * 2017-09-28 2021-07-20 Fujifilm Business Innovation Corp. System and method for visual exploration of subnetwork patterns in two-mode networks
CN110659736B (en) * 2019-08-01 2023-01-17 广东工业大学 Visual system for identifying evolution algorithm parameterized effect
CN110555005B (en) * 2019-08-12 2020-05-19 中国南方电网有限责任公司 Extensible open type paradigm method and device applied to power grid monitoring system
US11366826B2 (en) 2019-09-05 2022-06-21 International Business Machines Corporation Customizing data visualizations according to user activity
US11233705B2 (en) * 2019-12-23 2022-01-25 Robert Brennan Garvey System and method for visualizing and navigating network data

Family Cites Families (42)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5793369A (en) 1991-12-06 1998-08-11 Lucent Technologies Inc. Apparatus for visualizing program slices
US5847972A (en) 1993-09-24 1998-12-08 Eick; Stephen Gregory Method and apparatus for graphically analzying a log-file
US5765146A (en) * 1993-11-04 1998-06-09 International Business Machines Corporation Method of performing a parallel relational database query in a multiprocessor environment
US5689628A (en) 1994-04-14 1997-11-18 Xerox Corporation Coupling a display object to a viewpoint in a navigable workspace
US5608850A (en) 1994-04-14 1997-03-04 Xerox Corporation Transporting a display object coupled to a viewpoint within or between navigable workspaces
US5751931A (en) 1995-04-18 1998-05-12 Lucent Technologies Inc. Three dimensional graphical display of information
US6199082B1 (en) * 1995-07-17 2001-03-06 Microsoft Corporation Method for delivering separate design and content in a multimedia publishing system
US6430309B1 (en) * 1995-09-15 2002-08-06 Monogen, Inc. Specimen preview and inspection system
US6199098B1 (en) 1996-02-23 2001-03-06 Silicon Graphics, Inc. Method and apparatus for providing an expandable, hierarchical index in a hypertextual, client-server environment
US6023280A (en) 1996-02-28 2000-02-08 Lucent Technologies Inc. Calculation and visualization of tabular data
US5812134A (en) 1996-03-28 1998-09-22 Critical Thought, Inc. User interface navigational system & method for interactive representation of information contained within a database
US5999192A (en) * 1996-04-30 1999-12-07 Lucent Technologies Inc. Interactive data exploration apparatus and methods
US6014661A (en) 1996-05-06 2000-01-11 Ivee Development Ab System and method for automatic analysis of data bases and for user-controlled dynamic querying
US5831631A (en) 1996-06-27 1998-11-03 Intel Corporation Method and apparatus for improved information visualization
US5790121A (en) * 1996-09-06 1998-08-04 Sklar; Peter Clustering user interface
US6026409A (en) 1996-09-26 2000-02-15 Blumenthal; Joshua O. System and method for search and retrieval of digital information by making and scaled viewing
US5958008A (en) * 1996-10-15 1999-09-28 Mercury Interactive Corporation Software system and associated methods for scanning and mapping dynamically-generated web documents
US6034697A (en) 1997-01-13 2000-03-07 Silicon Graphics, Inc. Interpolation between relational tables for purposes of animating a data visualization
US5861891A (en) 1997-01-13 1999-01-19 Silicon Graphics, Inc. Method, system, and computer program for visually approximating scattered data
US6252597B1 (en) 1997-02-14 2001-06-26 Netscape Communications Corporation Scalable user interface for graphically representing hierarchical data
US6182058B1 (en) 1997-02-28 2001-01-30 Silicon Graphics, Inc. Bayes rule based and decision tree hybrid classifier
US5937064A (en) 1997-03-03 1999-08-10 Lucent Technologies, Inc. System and method for interactive visualization, analysis and control of a dynamic database
US6137499A (en) 1997-03-07 2000-10-24 Silicon Graphics, Inc. Method, system, and computer program product for visualizing data using partial hierarchies
US5960435A (en) 1997-03-11 1999-09-28 Silicon Graphics, Inc. Method, system, and computer program product for computing histogram aggregations
US6311196B1 (en) * 1997-04-09 2001-10-30 Rae Technology Inc. Method and apparatus for implementing web pages having master borders
US6075530A (en) 1997-04-17 2000-06-13 Maya Design Group Computer system and method for analyzing information using one or more visualization frames
US5930803A (en) 1997-04-30 1999-07-27 Silicon Graphics, Inc. Method, system, and computer program product for visualizing an evidence classifier
US6232984B1 (en) 1997-05-22 2001-05-15 Lucent Technologies, Inc. Data visualization system
US6026399A (en) 1997-05-30 2000-02-15 Silicon Graphics, Inc. System and method for selection of important attributes
JPH1165803A (en) 1997-08-22 1999-03-09 Nec Corp Information visualization system
US6236994B1 (en) 1997-10-21 2001-05-22 Xerox Corporation Method and apparatus for the integration of information and knowledge
US6154212A (en) 1997-11-06 2000-11-28 Lucent Technologies Inc. Method and apparatus for constructing network interfaces
US6223145B1 (en) 1997-11-26 2001-04-24 Zerox Corporation Interactive interface for specifying searches
US6151595A (en) 1998-04-17 2000-11-21 Xerox Corporation Methods for interactive visualization of spreading activation using time tubes and disk trees
US6742003B2 (en) * 2001-04-30 2004-05-25 Microsoft Corporation Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications
US6300957B1 (en) 1998-07-29 2001-10-09 Inxight Software, Inc. Mapping a node-link structure to a rendering space beginning from any node
US6243093B1 (en) 1998-09-14 2001-06-05 Microsoft Corporation Methods, apparatus and data structures for providing a user interface, which exploits spatial memory in three-dimensions, to objects and which visually groups matching objects
US6496832B2 (en) 1998-10-20 2002-12-17 University Of Minnesota Visualization spreadsheet
US6629097B1 (en) * 1999-04-28 2003-09-30 Douglas K. Keith Displaying implicit associations among items in loosely-structured data sets
US6613098B1 (en) * 1999-06-15 2003-09-02 Microsoft Corporation Storage of application specific data in HTML
US6457045B1 (en) * 1999-08-30 2002-09-24 Zaplet, Inc. System and method for group choice making
US6473084B1 (en) * 1999-09-08 2002-10-29 C4Cast.Com, Inc. Prediction input

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
No Search *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8959480B2 (en) 2004-05-26 2015-02-17 Pegasystems Inc. Methods and apparatus for integration of declarative rule-based processing with procedural programming in a digital data-processing environment
US9658735B2 (en) 2006-03-30 2017-05-23 Pegasystems Inc. Methods and apparatus for user interface optimization
US10838569B2 (en) 2006-03-30 2020-11-17 Pegasystems Inc. Method and apparatus for user interface non-conformance detection and correction
US10467200B1 (en) 2009-03-12 2019-11-05 Pegasystems, Inc. Techniques for dynamic data processing
US9678719B1 (en) 2009-03-30 2017-06-13 Pegasystems Inc. System and software for creation and modification of software
US8880487B1 (en) 2011-02-18 2014-11-04 Pegasystems Inc. Systems and methods for distributed rules processing
US9270743B2 (en) 2011-02-18 2016-02-23 Pegasystems Inc. Systems and methods for distributed rules processing
US10572236B2 (en) 2011-12-30 2020-02-25 Pegasystems, Inc. System and method for updating or modifying an application without manual coding
US10469396B2 (en) 2014-10-10 2019-11-05 Pegasystems, Inc. Event processing with enhanced throughput
US11057313B2 (en) 2014-10-10 2021-07-06 Pegasystems Inc. Event processing with enhanced throughput
US10698599B2 (en) 2016-06-03 2020-06-30 Pegasystems, Inc. Connecting graphical shapes using gestures
US10698647B2 (en) 2016-07-11 2020-06-30 Pegasystems Inc. Selective sharing for collaborative application usage
US11048488B2 (en) 2018-08-14 2021-06-29 Pegasystems, Inc. Software code optimizer and method
US11567945B1 (en) 2020-08-27 2023-01-31 Pegasystems Inc. Customized digital content generation systems and methods

Also Published As

Publication number Publication date
US6928436B2 (en) 2005-08-09
WO2003073319A8 (en) 2004-03-18
US20030167278A1 (en) 2003-09-04
AU2003210385A1 (en) 2003-09-09

Similar Documents

Publication Publication Date Title
US6928436B2 (en) Interactive generation of graphical visualizations of large data structures
EP1175661B1 (en) Computer method and apparatus for creating visible graphics by using a graph algebra
Keim Pixel-oriented visualization techniques for exploring very large data bases
Senay et al. A knowledge-based system for visualization design
US6750864B1 (en) Programs and methods for the display, analysis and manipulation of multi-dimensional data implemented on a computer
US7170519B2 (en) Computer-implemented system and method for generating data graphical displays
Keim Visual techniques for exploring databases
US8190620B2 (en) Apparatus and method for visualizing data within a decomposition graph
US7012602B2 (en) Virtual three-dimensional display for product development
WO2006060773A2 (en) Computer systems and methods for visualizing data with generation of marks
Mansmann et al. Exploring OLAP aggregates with hierarchical visualization techniques
Drucker et al. A unifying framework for animated and interactive unit visualizations
Wilson et al. Dynamic hierarchy specification and visualization
WO2001037120A2 (en) Programs and methods for the display, analysis and manipulation of multi-dimensional data
US6940509B1 (en) Systems and methods for improving concept landscape visualizations as a data analysis tool
EP0644500B1 (en) Method and system for producing a table image having focus and context areas showing direct and indirect data representations
Xie et al. Design and implementation of attribute database management system in a GIS system: GeoStar
Kaur et al. Pixel-Oriented Technique: A technique to visualize multidimensional data
Nation et al. Browse hierarchical data with the degree of interest tree
Kristiansen A Visual Language for Nested Visualization Design
Friedrichs et al. TILINGS AND SYMBOLS A REPORT ON THE USES OF SYMBOLIC CALCULATION IN TILING THEORY
Neumann Focus+ Context Visualization of Relations in Hierarchical Data
Stegeman User's Manual for Interactive Data Display System (IDDS)
Ruby Information Visualization for Financial Analysis
Al Faruque et al. A comparative Study: Multidimensional Visualization

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SK SL TJ TM TN TR TT TZ UA UG UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
D17 Declaration under article 17(2)a
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP