CN102968469A - Method and system for building application index, and method and system for application indexes - Google Patents

Method and system for building application index, and method and system for application indexes Download PDF

Info

Publication number
CN102968469A
CN102968469A CN2012104492843A CN201210449284A CN102968469A CN 102968469 A CN102968469 A CN 102968469A CN 2012104492843 A CN2012104492843 A CN 2012104492843A CN 201210449284 A CN201210449284 A CN 201210449284A CN 102968469 A CN102968469 A CN 102968469A
Authority
CN
China
Prior art keywords
page
application
component
sub
node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012104492843A
Other languages
Chinese (zh)
Other versions
CN102968469B (en
Inventor
方圆
金峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaomi Technology Co Ltd
Original Assignee
Beijing Xiaomi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xiaomi Technology Co Ltd filed Critical Beijing Xiaomi Technology Co Ltd
Priority to CN201210449284.3A priority Critical patent/CN102968469B/en
Publication of CN102968469A publication Critical patent/CN102968469A/en
Application granted granted Critical
Publication of CN102968469B publication Critical patent/CN102968469B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The embodiment of the invention discloses a method and a system for building application index, and a method and a system for the application indexes, and aims to solve the problem of inaccurate positioning of application. The method comprises the following steps of analyzing a page of an application, so as to obtain the information of the page of the application; converting the information of the page into structured information; and according to the structured information, building a list of application indexes corresponding to the application. The page of the application is analyzed in details, and then the information in the application can be more completely contained in the list of the application indexes, so when the information is searched, all information in the application can be searched, the success rate of searching is improved, and the application required by a user can be located more accurately.

Description

A kind of application references method for building up and system, application search method and system
Technical field
The embodiment of the invention relates to database technical field, particularly relates to a kind of application references method for building up and system, application search method and system.
Background technology
Development along with intelligent mobile terminal, the application (Application, App) that runs on the intelligent mobile terminal is more and more universal, and will be the mobile Internet epoch future, App will be the I.D. of Enterprise Mobile internet, be the stepping-stone to success that enterprise seizes following personal digital assistant device.Yet along with constantly popularizing of App, information island also forms at mobile Internet gradually one by one, and this has also brought huge challenge for the retrieval of information.
Each App can upload the descriptors such as some titles about this APP, developer, brief introduction simultaneously when being uploaded to the App shop, these descriptors are as retrieving information corresponding to this APP.When user search, after the input keyword, if retrieve the keyword that comprises user's input in the descriptor of APP, then this APP is result for retrieval.
But foregoing description information can not comprise the full content of App, therefore, if the keyword of user's input is inaccurate, then probably retrieves the APP that really needs less than the user.For example, the user thinks search " fish-flavoured shredded pork ", inputs after this keyword, if the user really wants in the descriptor of the APP that searches for not this keyword, then may retrieve wrong APP, even retrieve less than corresponding APP, thereby cause accurately to navigate to the required APP of user.
Summary of the invention
The embodiment of the invention provides a kind of application references method for building up and system, can improve the hit rate of retrieval, more accurately navigates to the required application of user.
Accordingly, the embodiment of the invention also provides for application references table obtained above, uses the method and system of retrieval.
In order to address the above problem, the embodiment of the invention discloses a kind of application data base index establishing method, comprising:
Resolve the page of using, obtain the page info of described application;
Transforming described page info is structured message;
Set up the application references table of using correspondence according to described structured message.
Preferably, comprise a plurality of assemblies in the page of described application;
The page that described parsing is used obtains the page info of described application, comprising:
Read the page of application according to the hierarchical structure of the page;
Collect the page info of the described page, described page info comprises the attribute information of assembly in the page.
Preferably, the assembly in the described page comprises parent component and at least one sub-component corresponding to described parent component;
The attribute information of described parent component comprises the title of parent component;
The attribute information of described sub-component comprises the title of sub-component, the descriptor of sub-component, and the structural relation information of other sub-components in sub-component and parent component, sub-component and the same page.
Preferably, the described page info of described conversion is structured message, comprising:
With the assembly in using as node;
Determine set membership between the node and the brotherhood between the node according to the attribute information of assembly, have brotherhood between the node that wherein each sub-component in the same page is corresponding;
The node that will have set membership and brotherhood generates the tree information of corresponding described application.
Preferably, the described structured message of described foundation is set up and is used corresponding application references table, comprising:
From the title of described assembly and descriptor, extract the indexing key words information of corresponding node;
According to the indexing key words information of node and the relation between the node, set up and use the corresponding application references table with hierarchical structure.
On the other hand, the embodiment of the invention also discloses a kind of application search method, comprising:
Receive the indexing key words of input;
In the application references table that is pre-created, search described indexing key words;
If find, then obtain application corresponding to described indexing key words, described application is shown;
Wherein, the application references table of described application references table for setting up by the described method of claim 1-5 any one.
Preferably, describedly in the application references table that is pre-created, search described indexing key words, comprising:
Hierarchical structure according to the application references table travels through described application references table step by step, searches in the described application references table whether comprise described indexing key words, if comprise, then finds; If do not comprise, then do not find.
On the other hand, the embodiment of the invention also discloses a kind of application references and set up system, comprising:
The page info acquisition module is used for resolving the page of using, and obtains the page info of described application;
The page info conversion module, being used for transforming described page info is structured message;
The application references table is set up module, is used for setting up the application references table of using correspondence according to described structured message.
Preferably, comprise a plurality of assemblies in the page of described application;
Described page info acquisition module comprises:
Page reading submodule is used for reading the page of application according to the hierarchical structure of the page;
Page info is collected submodule, is used for collecting the page info of the described page, and described page info comprises the attribute information of assembly in the page.
Preferably, the assembly in the described page comprises parent component and at least one sub-component corresponding to described parent component;
The attribute information of described parent component comprises the title of parent component;
The attribute information of described sub-component comprises the title of sub-component, the descriptor of sub-component, and the structural relation information of other sub-components in sub-component and parent component, sub-component and the same page.
Preferably, described page info conversion module comprises:
Relation is determined submodule, and the assembly that is used for using is as node, determines set membership between the node and the brotherhood between the node according to the attribute information of assembly, has brotherhood between the node that wherein each sub-component in the same page is corresponding;
Structured message generates submodule, generates the tree information of corresponding described application for the node that will have set membership and brotherhood.
Preferably, described application references table is set up module and is comprised:
The keyword extraction submodule is used for the indexing key words information that extracts corresponding node from title and the descriptor of described assembly;
The application references table is set up submodule, is used for according to the indexing key words information of node and the relation between the node, sets up and uses the corresponding application references table with hierarchical structure.
On the other hand, the embodiment of the invention also discloses a kind of application searching system, comprising:
The keyword receiver module is used for receiving the indexing key words of inputting;
The keyword lookup module is used for searching described indexing key words at the application references table that is pre-created;
Use acquisition module, be used for when the keyword lookup module searches arrives described indexing key words, obtaining application corresponding to described indexing key words, described application is shown;
Wherein, described application references table is the application references table by the described system made of claim 8-12 any one.
Preferably, described keyword lookup module comprises:
Application references table traversal submodule is used for traveling through step by step described application references table according to the hierarchical structure of application references table, searches in the described application references table whether comprise described indexing key words, if comprise, then finds; If do not comprise, then do not find.
Compare with background technology, the embodiment of the invention comprises following advantage:
The embodiment of the invention is at first resolved the page of using, and obtains the page info of this application, then the page info of using is changed into structured message, and sets up the application references table of using correspondence according to said structure information.By the page of using is carried out detailed parsing, can be included in more all sidedly in the application references table so that use inner information, thereby when retrieval, using inner information can be retrieved, improve the hit rate of retrieval, can more accurately navigate to the required application of user.
Description of drawings
Fig. 1 is the process flow diagram of a kind of application references method for building up of the embodiment of the invention one proposition;
Fig. 2 is the process flow diagram of a kind of application references method for building up of the embodiment of the invention two propositions;
Fig. 3 is the synoptic diagram of clicking the page that enters behind " homely menu " icon in the embodiment of the invention two;
Fig. 4 is the synoptic diagram of clicking the page that obtains behind the sub-component " Sichuan cuisine " in the embodiment of the invention two;
Fig. 5 is the synoptic diagram of clicking the page that obtains behind the sub-component " Boiled beef " in the embodiment of the invention two;
Fig. 6 is the synoptic diagram of the tree information that changes into of the page info according to the page shown in Figure 3;
Fig. 7 is the synoptic diagram of the tree information that changes into of the page info according to the page shown in Figure 4;
Fig. 8 is the synoptic diagram of the tree information that changes into of the page info according to the page shown in Figure 5;
Fig. 9 is the synoptic diagram of using tree information corresponding to " homely menu " in the embodiment of the invention two;
Figure 10 is a kind of process flow diagram of using search method that the embodiment of the invention three proposes;
Figure 11 is the foundation of the application references that proposes of the embodiment of the invention and the process flow diagram of using the overall process of retrieval;
Figure 12 is the structured flowchart that a kind of application references of the embodiment of the invention four propositions is set up system;
Figure 13 is a kind of structured flowchart of using searching system that the embodiment of the invention five proposes.
Embodiment
For above-mentioned purpose, the feature and advantage that make the embodiment of the invention can become apparent more, below in conjunction with the drawings and specific embodiments the embodiment of the invention is described in further detail.
The embodiment of the invention is used for each, all its page is resolved, then set up the application references table according to the page info that is resolved to, comprised more comprehensively application message in this application references table, thereby when retrieval, can navigate to more accurately the required application of user.
Embodiment one:
With reference to Fig. 1, show the process flow diagram of a kind of application references method for building up of the embodiment of the invention one proposition.
Along with the development of intelligent mobile terminal, the application that runs on the intelligent mobile terminal is more and more universal, and is audio-visual etc. such as QQ, PPS, and the user can the own required application of search from use the shop.Each is applied in and is uploaded to when using the shop, all can upload simultaneously the descriptors such as some titles about this application, developer, brief introduction, and these descriptors are as retrieving information corresponding to this APP.But foregoing description information can not comprise the full content of application, therefore, if the keyword of user's input is inaccurate, can't find corresponding keyword from the descriptor of using, and then probably can't accurately navigate to the required application of user.
The embodiment of the invention proposes a kind of method for building up of application references for the problems referred to above, and the method specifically can may further comprise the steps:
Step S101 resolves the page of using, and obtains the page info of this application.
In the embodiment of the invention, use for each, it can comprise a plurality of pages, and the page info of these pages has shown the inner content information of this application.The embodiment of the invention can be resolved respectively each page of using, thereby gets access to the page info of this application.
Step S102, the conversion page info is structured message.
Wherein, structured message refers to resolve into a plurality of inter-related ingredients after information by analysis, and clear and definite hierarchical structure is arranged between each ingredient, and its operation and maintenance manages by database, and certain working specification is arranged.For example, the information that the database that usually contacts is managed comprises that the record of the aspects such as production, business, transaction, customer information all belongs to structured message.
Step S103 sets up the application references table of using correspondence according to structured message.
For the detailed process of above-mentioned each step, will introduce in detail in the following embodiments, present embodiment is no longer discussed at this.
The embodiment of the invention is by carrying out detailed parsing to the page of using, thereby get access to more all sidedly the information of this application, then set up the application references table according to these information, so that retrieval the time can improve the hit rate of retrieval, navigate to more accurately the required application of user.
Embodiment two:
Below, the application references method for building up that the embodiment of the invention is proposed describes in detail.
With reference to Fig. 2, show the process flow diagram of a kind of application references method for building up of the embodiment of the invention two propositions, the method can comprise:
Step S201 resolves the page of using, and obtains the page info of this application.
Use for each, it can comprise a plurality of pages, and the page info of these pages has shown the inner content information of this application.The embodiment of the invention can be resolved each page of using respectively, gets access to the page info of this application.
Concrete, the application that can resolve operates in the virtual operation platform, then in the process of using operation the page of using is resolved.
In embodiments of the present invention, the page of using is the page with hierarchical structure, can comprise one or more pages in every layer, comprises a plurality of assemblies in the page of using, present embodiment can be analyzed these assemblies, uses corresponding application references table by the information creating of these assemblies.
This step S201 specifically can comprise:
Step a1 reads the page of application according to the hierarchical structure of the page.
In the present embodiment, can read according to the hierarchical structure of the page page of application, concrete, can read according to the hierarchical structure of the page each page of application, for example from the ground floor page of using, each page in reading every layer successively.Certainly, can also read according to other modes the page of application, present embodiment is not limited this.
Wherein, hierarchical structure refers to several components through the system that coherent relationships consists of, and consists of the step by step formation structural relation of new system through new coherent relationships again.For example, for the ecosystem, general character, scale size so that energy variation scope etc. by each key element characteristic of system, contact method, function mark off 11 levels: the whole world (biosphere), zone (biotic formation), view, the ecosystem, group, population, individuality, tissue, cell, gene, molecule, wherein, the low-level process can be comprised by the behavior of high-level.
Step a2, the page info of the collection page, this page info comprises the attribute information of assembly in the page.
Owing to comprise a plurality of assemblies in the page, therefore for each page, can comprise the attribute information of each assembly in its page info.
Concrete, can when reading the page, collect the page info of the page, also can after reading the page of application, regather the page info of these pages, present embodiment is not limited this.
Obtaining the process of page info among the above-mentioned steps S201 can finish by program, can collect page info by the instrument Hierarchy Viewer that system carries in the present embodiment, certainly can also obtain by other means above-mentioned page info.
Comprise a plurality of assemblies in each page, can comprise the attribute information of each assembly in the page info of the page, can also comprise some other information, page layout information for example, this page layout information is specifically as follows assembly included in the page.
Wherein, the assembly in the page can comprise parent component and at least one sub-component corresponding to this parent component.The attribute information of parent component can comprise the title of parent component; The attribute information of sub-component can comprise the title of sub-component, the descriptor of sub-component, and the structural relation information of other sub-components in sub-component and parent component, sub-component and the same page.Concrete, the structural relation between sub-component and the parent component is set membership, the structural relation of other sub-components is brotherhood in sub-component and the same page.
Below, be elaborated as an example of application " homely menu " example.
As shown in Figure 3, be the synoptic diagram of a page in the ground floor page of " homely menu ", this page can be clicked for the user page that enters behind the icon of " homely menu ".
Fig. 3 has shown the page info of this page, as can be seen from the figure, has comprised 5 assemblies in this page, comprises 1 parent component and 4 sub-components.
Wherein, the name of parent component is called " homely menu ", and this parent component does not have descriptor; The title of sub-component is respectively " Sichuan cuisine ", " Guangdong dishes ", " Zhejiang dish ", " Shanghai dish ", literal below the title of each sub-component is the descriptor of this sub-component, for example sub-component " Sichuan cuisine " descriptor is " reputation that each dish has different flavour is usually arranged ... ".4 sub-components " Sichuan cuisine ", " Guangdong dishes ", " Zhejiang dish ", " Shanghai dish " are respectively set membership with the relation of parent component " homely menu ", are brotherhood between 4 sub-components.
Certainly, can also comprise other assembly in this page, such as " Shandong cuisine ", " Fujian cuisine ", " Hunan cuisine " etc., the embodiment of the invention is not limited this.
In this page, parent component can not be clicked, other sub-component all can be clicked, click after these sub-components, can obtain lower one deck page corresponding to this sub-component, namely in lower one deck page of this page, can comprise a plurality of pages, these pages obtain by the sub-component of clicking in this page respectively.
As shown in Figure 4, for clicking the synoptic diagram of the page that obtains behind the sub-component " Sichuan cuisine ", as can be seen from the figure, 1 parent component and 4 sub-components have also been comprised in this page.
Wherein, the name of parent component is called " Sichuan cuisine ", and this parent component does not have descriptor; The title of sub-component is respectively " the Fish with Chinese Sauerkraut ", " Boiled beef ", " river pickled cucumber with chilli chilli sauce ", " Spicy diced chicken with peanuts ", literal below the title of each sub-component is the descriptor of this sub-component, for example the descriptor of sub-component " the Fish with Chinese Sauerkraut " is " grass carp 600 grams, bubble sauerkraut 100 grams ... ".4 sub-components " the Fish with Chinese Sauerkraut ", " Boiled beef ", " river pickled cucumber with chilli chilli sauce ", " Spicy diced chicken with peanuts " are respectively set membership with the relation of parent component " Sichuan cuisine ", are brotherhood between 4 sub-components.
Same, in this page, parent component " Sichuan cuisine " can not be clicked, and other sub-component all can be clicked, and clicks after these sub-components, can obtain lower one deck page corresponding to this sub-component.
As shown in Figure 5, for clicking the synoptic diagram of the page that obtains behind the sub-component " Boiled beef ", as can be seen from the figure, 1 parent component and 2 sub-components have been comprised in this page.
Wherein, the name of parent component is called " Boiled beef ", and this parent component does not have descriptor; The title of sub-component is respectively " material " and " way ", and the literal below the title of each sub-component is the descriptor of this sub-component, and for example the descriptor of sub-component " material " is " beef 500 grams ... ".Sub-component " material " and " way " are respectively set membership with the relation of parent component " Boiled beef ", are brotherhood between 2 sub-components.
In the present embodiment, this page is last one deck page of " homely menu ", and namely in this page, each assembly all cannot be clicked.
Need to prove, the above-mentioned application of introducing " homely menu " is just as example, the page info that in each page, can also comprise other, and, the method that other application also can utilize the embodiment of the invention to propose is processed, for example, and about the application of tourism aspect, it can arrange the page according to the hierarchical structure in country, province, city, county etc., and present embodiment is discussed no longer in detail to this.
Step S202, the conversion page info is structured message.
To use corresponding application references table in order setting up, after collecting the page info of application by above-mentioned steps S201, can be further the page info of these application to be changed into structured message, then set up the application references table according to structured message.
Concrete, the conversion page info is that the step of structured message specifically can comprise:
Step b1, with the assembly in using as node.
Step b2 determines set membership between the node and the brotherhood between the node according to the attribute information of assembly, has brotherhood between the node that wherein each sub-component in the same page is corresponding.
Step b3, the node that will have set membership and brotherhood generates the corresponding tree information of using.
The embodiment of the invention can change into tree information with the page info of using, wherein, can be with each assembly in using as a node, for the node in the page, can determine set membership between the node and the brotherhood between the node according to the attribute information of node, generate corresponding tree information according to these nodes.
Need to prove, in embodiments of the present invention, can when collecting page info page info be changed into tree information, can transform after collecting each page info yet again, present embodiment is not limited this.
Still take above-mentioned " homely menu " as example, respectively Fig. 3-3 pages shown in Figure 5 are analyzed, obtained tree information corresponding to each page.
The synoptic diagram of the tree information that changes into for the page info according to the page shown in Figure 3 as shown in Figure 6.
Can learn by above-mentioned analysis, comprise 5 nodes in the page shown in Figure 3, father node is " homely menu ", and child node is " Sichuan cuisine ", " Guangdong dishes ", " Zhejiang dish " and " Shanghai dish ".As can be seen from Figure 6, ground floor is father node " homely menu " in tree information corresponding to this page, the second layer be father node " homely menu " and child node " Sichuan cuisine ", " Guangdong dishes ", " Zhejiang dish " and " Shanghai dish " (wherein, the information of child node also comprises the descriptor of node, all do not write out in the drawings), be brotherhood between each child node, certainly, can also comprise other child nodes.
Need to prove, although comprise node " homely menu " in the second layer, but the node " homely menu " of itself and ground floor is not set membership, the embodiment of the invention is just in order to illustrate that each node in the second layer is included in the middle of the same page, and the node in the second layer " homely menu " does not have child node.
The synoptic diagram of the tree information that changes into for the page info according to the page shown in Figure 4 as shown in Figure 7.
Can learn by above-mentioned analysis, the page shown in Figure 4 obtains by " Sichuan cuisine " among click Fig. 3, comprise 5 nodes in the page shown in Figure 4, father node is " Sichuan cuisine ", and child node is " the Fish with Chinese Sauerkraut ", " Boiled beef ", " river pickled cucumber with chilli chilli sauce ", " Spicy diced chicken with peanuts ".As can be seen from Figure 7, ground floor is father node " Sichuan cuisine " in tree information corresponding to this page, the second layer is father node " Sichuan cuisine " and child node " the Fish with Chinese Sauerkraut ", " Boiled beef ", " river pickled cucumber with chilli chilli sauce ", " Spicy diced chicken with peanuts ", it is brotherhood between each child node, certainly, can also comprise other child nodes.
The synoptic diagram of the tree information that changes into for the page info according to the page shown in Figure 5 as shown in Figure 8.
The page shown in Figure 5 obtains by " Boiled beef " among click Fig. 4, comprises 3 nodes in the page shown in Figure 4, and father node is " Boiled beef ", and child node is " material " and " way ".As can be seen from Figure 8, ground floor is father node " Boiled beef " in tree information corresponding to this page, and the second layer is father node " Boiled beef " and child node " material " and " way ", is brotherhood between each child node, certainly, can also comprise other child nodes.
Analyze by the tree information to above-mentioned each page, finally can generate the corresponding tree information of using, as shown in Figure 9.The entire infrastructure information that does not comprise application in Fig. 9 for example, is not pointed out for the child node of " Guangdong dishes ", and for concrete information, those skilled in the art carry out respective handling according to actual conditions and get final product, and present embodiment is discussed no longer in detail at this.
At last, need to prove, in above-mentioned tree information, each node is only relevant with the node of page inside, place, for example, if for the Fish with Chinese Sauerkraut in the Sichuan cuisine, in the dish of Hunan, also comprise the Fish with Chinese Sauerkraut, but in this branch of Sichuan cuisine, " the Fish with Chinese Sauerkraut " be the child node of conduct " Sichuan cuisine " only.
Step S203 sets up the application references table of using correspondence according to structured message.
After the structured message that is applied, can set up according to these structured messages and use corresponding application references table.
In the embodiment of the invention, use for each, can set up according to the structured message of this application and use corresponding application references table.
This step specifically can comprise:
Step c1 extracts the indexing key words information of corresponding node from the title of assembly and descriptor.
Step c2 according to the indexing key words information of node and the relation between the node, sets up and uses the corresponding application references table with hierarchical structure.
For each assembly in the page of using, can from the title of this assembly and descriptor, extract the indexing key words information of corresponding node, concrete, the process of extracting indexing key words information can comprise:
(1) title and the descriptor of this assembly are analyzed, obtained the wherein part of speech of each word;
Wherein part of speech can comprise: noun, verb, adjective, number, measure word and pronoun, adverbial word, preposition, conjunction, auxiliary word, onomatopoeia and interjection etc.
(2) according to the part of speech of word title and the descriptor of assembly are carried out the participle operation;
(3) word that importance behind the participle is lower deletion, such as wherein auxiliary word of deletion, interjection etc.;
(4) will remain one or more as above-mentioned indexing key words information in the word.
Certainly, can also adopt additive method to extract indexing key words, the embodiment of the invention is not limited this.
For example, the assembly that is called " Sichuan cuisine " for name, can be with the title " Sichuan cuisine " of this assembly indexing key words information as corresponding node, can also be with the indexing key words of certain or some word in its descriptor as corresponding node, such as " stir-fry ", " frying in shallow oil ", " dry combustion method " etc., the embodiment of the invention is not limited this.
In embodiments of the present invention, for above-mentioned application references table, can set up according to the hierarchical structure of using corresponding tree information, certainly, can also set up other forms of concordance list, such as inverted index table etc., present embodiment is not limited this.
Step S204 generates application data base according to the application references table.
Establish and use after the corresponding application references table, can generate application data base according to these application references tables, certainly in this application data base, can also comprise other the relevant data of application.For the process of concrete generation application data base, those skilled in the art carry out relevant treatment according to practical experience and get final product, and the embodiment of the invention is discussed no longer in detail at this.
Need to prove that above-mentioned application data base can have following source in embodiments of the present invention:
(1) in retrieval server, sets up concordance list, and generate application data base;
(2) set up concordance list in client, this concordance list is uploaded to retrieval server, generate application data base;
(3) set up concordance list by the third party, and generate application data base.
Certainly, this application data base can also have other sources, and the embodiment of the invention is not limited this.
Embodiment three:
For the application references table of above-mentioned foundation, the embodiment of the invention has also proposed a kind ofly to use the method for retrieval for this application references table.
As shown in figure 10, be a kind of process flow diagram of using search method that the embodiment of the invention three proposes, the method specifically can comprise:
Step S1001 receives the indexing key words of inputting.
In the present embodiment, can in using the shop, provide search box, when wanting search to use, can directly input in the search box indexing key words, specifically can be by keyboard input or handwriting input, present embodiment is not limited this.
Step S1002 searches above-mentioned indexing key words in the application references table that is pre-created.
Receive the indexing key words of user's input, search this indexing key words in the application references table that can in above-described embodiment two, create.
This step specifically can comprise:
Hierarchical structure according to the application references table travels through above-mentioned application references table step by step, searches whether comprise above-mentioned indexing key words in the application references table, if comprise, then finds; If do not comprise, then do not find.
Step S1003 if find, then obtains application corresponding to indexing key words, will use demonstration.
Still the application " homely menu " in above-described embodiment two supposes that as example indexing key words corresponding to each node is the title of corresponding assembly, and the application references table is set up according to the form identical with the hierarchical structure of the tree information of using.
For example, the indexing key words of user's input is " Boiled beef ", then can travel through step by step from top to bottom according to the hierarchical structure of application references table, at first in the ground floor of application references table, search, through searching in the ground floor not this keyword, then search the second layer, in the 3rd layer of application references table, find this indexing key words at last.Find after the corresponding indexing key words, then obtain application corresponding to this indexing key words " homely menu ", this application is shown to the user, the user can select to download or carry out other operations.
If in the application references table, do not find this indexing key words, then display reminding information " does not find corresponding application ", and points out the indexing key words of inputting other, certainly, can also notify by other means lookup result, the embodiment of the invention is not limited this.
Need to prove that foregoing description just is used for for example, if the application references table adopts other version in the present embodiment, then those skilled in the art carry out respective handling according to actual conditions and get final product, and the embodiment of the invention is discussed no longer in detail at this.
The application search method that the embodiment of the invention proposes is based on that the application references table set up in above-described embodiment two carries out, because this application references table is by the page of using is carried out detailed parsing, comprising more comprehensively using internal information, thereby when user search, using inner information can be retrieved, improve the hit rate of retrieval, can more accurately navigate to the required application of user.
What above-described embodiment one and embodiment two described is the process of setting up of application references, and embodiment three describes is the process that the application references table that utilizes embodiment one or embodiment two to set up is retrieved.Below, comprehensive above description, the foundation of the application references table that the embodiment of the invention is proposed and the overall process of using retrieval are introduced, and as shown in figure 11, this process comprises:
1, in the virtual operation platform operation each application (comprise APP1, APP2 ...).
2, in operational process, carry out the APP information analysis, in the resolving, constantly carry out APP internal node structure elucidation and APP internal node details and resolve.
The information of the application that 3, will parse change into APP internal structured information (comprise APP1 internal structured information, APP2 internal structured information ...).
4, set up application references table (Indexer) for the internal structured information of each APP.
5, the required application of retrieval user in the application references table of above-mentioned foundation.
For the detailed introduction of above-mentioned each process, the associated description of above-mentioned each embodiment of reference gets final product, and present embodiment is not described in detail at this.
The embodiment of the invention is by carrying out detailed parsing to the page of using, thereby can get access to more all sidedly the information of this application, then set up the application references table according to these information, so that retrieval the time can improve the hit rate of retrieval, navigate to more accurately the required application of user.
Need to prove, for aforesaid embodiment of the method, for simple description, so it all is expressed as a series of combination of actions, but those skilled in the art should know, the embodiment of the invention is not subjected to the restriction of described sequence of movement, because according to the embodiment of the invention, some step can adopt other orders or carry out simultaneously.Secondly, those skilled in the art also should know, the embodiment described in the instructions all belongs to preferred embodiment, and related action might not be that the embodiment of the invention is necessary.
Embodiment four:
With reference to Figure 12, a kind of application references that shows the embodiment of the invention four propositions is set up the structured flowchart of system, and this system comprises: page info acquisition module 121, page info conversion module 122 and application references table are set up module 123.
Wherein,
Page info acquisition module 121 is used for resolving the page of using, and obtains the page info of application;
In embodiments of the present invention, application can comprise a plurality of pages, can resolve each page respectively.The page of above-mentioned application can for having the page of hierarchical structure, comprise a plurality of assemblies in the page of this application.
This page info acquisition module 121 specifically can comprise:
Page reading submodule is used for reading the page of application according to the hierarchical structure of the page;
Page info is collected submodule, is used for collecting the page info of the page, and page info comprises the attribute information of assembly in the page.
In the present embodiment, the assembly in the page comprises parent component and at least one sub-component corresponding to this parent component.Wherein, the attribute information of parent component comprises the title of parent component; The attribute information of sub-component comprises the title of sub-component, the descriptor of sub-component, and the structural relation information of other sub-components in sub-component and parent component, sub-component and the same page.
Page info conversion module 122, being used for transforming page info is structured message;
This page info conversion module 122 specifically can comprise:
Relation is determined submodule, and the assembly that is used for using is as node, determines set membership between the node and the brotherhood between the node according to the attribute information of assembly, has brotherhood between the node that wherein each sub-component in the same page is corresponding;
Structured message generates submodule, is used for having the corresponding tree information of using of node generation of set membership and brotherhood.
The application references table is set up module 123, is used for setting up the application references table of using correspondence according to structured message.
This application references table is set up module 123 and specifically can be comprised:
The keyword extraction submodule is used for the indexing key words information that extracts corresponding node from title and the descriptor of assembly;
The application references table is set up submodule, is used for according to the indexing key words information of node and the relation between the node, sets up and uses the corresponding application references table with hierarchical structure.
Embodiment five:
For the application references table that said system is set up, the embodiment of the invention has also proposed a kind of application searching system.
As shown in figure 13, be a kind of structured flowchart of using searching system that the embodiment of the invention five proposes, this system comprises keyword receiver module 131, keyword lookup module 132 and uses acquisition module 133.
Wherein,
Keyword receiver module 131 is used for receiving the indexing key words of inputting;
Keyword lookup module 132 is used for searching indexing key words at the application references table that is pre-created;
This keyword lookup module 132 specifically can comprise:
Application references table traversal submodule is used for searching whether comprise above-mentioned indexing key words in the application references table according to the hierarchical structure of application references table traversal applications concordance list step by step, if comprise, then finds; If do not comprise, then do not find.
Use acquisition module 133, be used for when the keyword lookup module searches arrives above-mentioned indexing key words, obtaining application corresponding to this indexing key words, will use demonstration.
The application searching system that the embodiment of the invention proposes is based on that application references table that the system in above-described embodiment four sets up carries out, because this application references table is by the page of using is carried out detailed parsing, comprising more comprehensively using internal information, thereby when user search, using inner information can be retrieved, improve the hit rate of retrieval, can more accurately navigate to the required application of user.
For system embodiment because itself and embodiment of the method basic simlarity, so describe fairly simple, relevant part gets final product referring to the part explanation of embodiment of the method.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and what each embodiment stressed is and the difference of other embodiment that identical similar part is mutually referring to getting final product between each embodiment.
What those skilled in the art were easy to expect is: it all is feasible that the combination in any of above-mentioned each embodiment is used, so the combination in any between above-mentioned each embodiment all is embodiment of the present invention, but this instructions has not just described in detail one by one at this as space is limited.
The embodiment of the invention can be described in the general context of the computer executable instructions of being carried out by computing machine, for example program module.Usually, program module comprises the routine carrying out particular task or realize particular abstract data type, program, object, assembly, data structure etc.Also can in distributed computing environment, put into practice the present invention, in these distributed computing environment, be executed the task by the teleprocessing equipment that is connected by communication network.In distributed computing environment, program module can be arranged in the local and remote computer-readable storage medium that comprises memory device.
At last, also need to prove, in this article, relational terms such as the first and second grades only is used for an entity or operation are made a distinction with another entity or operation, and not necessarily requires or hint and have the relation of any this reality or sequentially between these entities or the operation.And, term " comprises ", " comprising " or its any other variant are intended to contain comprising of nonexcludability, thereby not only comprise those key elements so that comprise process, method, commodity or the equipment of a series of key elements, but also comprise other key elements of clearly not listing, or also be included as the intrinsic key element of this process, method, commodity or equipment.Do not having in the situation of more restrictions, the key element that is limited by statement " comprising ... ", and be not precluded within process, method, commodity or the equipment that comprises above-mentioned key element and also have other identical element.
More than a kind of application references method for building up and system, application search method and system that the embodiment of the invention is provided, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, the thought according to the embodiment of the invention all will change in specific embodiments and applications, and in sum, this description should not be construed as the restriction to the embodiment of the invention.

Claims (14)

1. an application data base index establishing method is characterized in that, comprising:
Resolve the page of using, obtain the page info of described application;
Transforming described page info is structured message;
Set up the application references table of using correspondence according to described structured message.
2. method according to claim 1 is characterized in that, comprises a plurality of assemblies in the page of described application;
The page that described parsing is used obtains the page info of described application, comprising:
Read the page of application according to the hierarchical structure of the page;
Collect the page info of the described page, described page info comprises the attribute information of assembly in the page.
3. method according to claim 2 is characterized in that:
Assembly in the described page comprises parent component and at least one sub-component corresponding to described parent component;
The attribute information of described parent component comprises the title of parent component;
The attribute information of described sub-component comprises the title of sub-component, the descriptor of sub-component, and the structural relation information of other sub-components in sub-component and parent component, sub-component and the same page.
4. method according to claim 3 is characterized in that, the described page info of described conversion is structured message, comprising:
With the assembly in using as node;
Determine set membership between the node and the brotherhood between the node according to the attribute information of assembly, have brotherhood between the node that wherein each sub-component in the same page is corresponding;
The node that will have set membership and brotherhood generates the tree information of corresponding described application.
5. method according to claim 4 is characterized in that, the described structured message of described foundation is set up and used corresponding application references table, comprising:
From the title of described assembly and descriptor, extract the indexing key words information of corresponding node;
According to the indexing key words information of node and the relation between the node, set up and use the corresponding application references table with hierarchical structure.
6. use search method for one kind, it is characterized in that, comprising:
Receive the indexing key words of input;
In the application references table that is pre-created, search described indexing key words;
If find, then obtain application corresponding to described indexing key words, described application is shown;
Wherein, the application references table of described application references table for setting up by the described method of claim 1-5 any one.
7. method according to claim 6 is characterized in that, describedly searches described indexing key words in the application references table that is pre-created, and comprising:
Hierarchical structure according to the application references table travels through described application references table step by step, searches in the described application references table whether comprise described indexing key words, if comprise, then finds; If do not comprise, then do not find.
8. an application references is set up system, it is characterized in that, comprising:
The page info acquisition module is used for resolving the page of using, and obtains the page info of described application;
The page info conversion module, being used for transforming described page info is structured message;
The application references table is set up module, is used for setting up the application references table of using correspondence according to described structured message.
9. system according to claim 8 is characterized in that, comprises a plurality of assemblies in the page of described application;
Described page info acquisition module comprises:
Page reading submodule is used for reading the page of application according to the hierarchical structure of the page;
Page info is collected submodule, is used for collecting the page info of the described page, and described page info comprises the attribute information of assembly in the page.
10. system according to claim 9 is characterized in that:
Assembly in the described page comprises parent component and at least one sub-component corresponding to described parent component;
The attribute information of described parent component comprises the title of parent component;
The attribute information of described sub-component comprises the title of sub-component, the descriptor of sub-component, and the structural relation information of other sub-components in sub-component and parent component, sub-component and the same page.
11. system according to claim 10 is characterized in that, described page info conversion module comprises:
Relation is determined submodule, and the assembly that is used for using is as node, determines set membership between the node and the brotherhood between the node according to the attribute information of assembly, has brotherhood between the node that wherein each sub-component in the same page is corresponding;
Structured message generates submodule, generates the tree information of corresponding described application for the node that will have set membership and brotherhood.
12. system according to claim 11 is characterized in that, described application references table is set up module and is comprised:
The keyword extraction submodule is used for the indexing key words information that extracts corresponding node from title and the descriptor of described assembly;
The application references table is set up submodule, is used for according to the indexing key words information of node and the relation between the node, sets up and uses the corresponding application references table with hierarchical structure.
13. use searching system for one kind, it is characterized in that, comprising:
The keyword receiver module is used for receiving the indexing key words of inputting;
The keyword lookup module is used for searching described indexing key words at the application references table that is pre-created;
Use acquisition module, be used for when the keyword lookup module searches arrives described indexing key words, obtaining application corresponding to described indexing key words, described application is shown;
Wherein, described application references table is the application references table by the described system made of claim 8-12 any one.
14. system according to claim 13 is characterized in that, described keyword lookup module comprises:
Application references table traversal submodule is used for traveling through step by step described application references table according to the hierarchical structure of application references table, searches in the described application references table whether comprise described indexing key words, if comprise, then finds; If do not comprise, then do not find.
CN201210449284.3A 2012-11-09 2012-11-09 A kind of application references method for building up and system, application search method and system Active CN102968469B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210449284.3A CN102968469B (en) 2012-11-09 2012-11-09 A kind of application references method for building up and system, application search method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210449284.3A CN102968469B (en) 2012-11-09 2012-11-09 A kind of application references method for building up and system, application search method and system

Publications (2)

Publication Number Publication Date
CN102968469A true CN102968469A (en) 2013-03-13
CN102968469B CN102968469B (en) 2016-05-18

Family

ID=47798607

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210449284.3A Active CN102968469B (en) 2012-11-09 2012-11-09 A kind of application references method for building up and system, application search method and system

Country Status (1)

Country Link
CN (1) CN102968469B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324757A (en) * 2013-07-09 2013-09-25 百度在线网络技术(北京)有限公司 Method and device for searching application programs in mobile terminal
CN105630907A (en) * 2015-12-21 2016-06-01 北京大学 Method for assembling android application based on content of application
CN105930477A (en) * 2016-04-28 2016-09-07 北京小米移动软件有限公司 Information search method and device
CN105955618A (en) * 2016-04-29 2016-09-21 北京小米移动软件有限公司 Information display method and device
CN106687949A (en) * 2014-06-24 2017-05-17 谷歌公司 Search results for native applications
CN106934006A (en) * 2017-03-08 2017-07-07 中国银行股份有限公司 Web page recommendation method and device based on multi-fork tree-model
CN107861753A (en) * 2017-06-26 2018-03-30 平安普惠企业管理有限公司 APP generations index, search method and system and readable storage medium storing program for executing
CN108595529A (en) * 2018-03-30 2018-09-28 苏州风中智能科技有限公司 A kind of device of retrieval software function
CN110069463A (en) * 2019-03-12 2019-07-30 北京奇艺世纪科技有限公司 User behavior processing method, device electronic equipment and storage medium
CN112115228A (en) * 2019-06-19 2020-12-22 Oppo广东移动通信有限公司 Searching method, searching device, terminal and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010049675A1 (en) * 2000-06-05 2001-12-06 Benjamin Mandler File system with access and retrieval of XML documents

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010049675A1 (en) * 2000-06-05 2001-12-06 Benjamin Mandler File system with access and retrieval of XML documents

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
刘畅: "综合搜索引擎与垂直搜索引擎的比较研究", 《情报科学》, vol. 25, no. 1, 31 January 2007 (2007-01-31) *
史红改等: "《实用网络文献信息资源检索与利用》", 31 March 2009, 清华大学出版社, article "实用网络文献信息资源检索与利用" *
尤波: "垂直搜索中XML索引及页面排序技术研究", 《中国优秀硕士学位论文全文数据库信息科技辑》, 15 June 2009 (2009-06-15) *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103324757A (en) * 2013-07-09 2013-09-25 百度在线网络技术(北京)有限公司 Method and device for searching application programs in mobile terminal
CN106687949A (en) * 2014-06-24 2017-05-17 谷歌公司 Search results for native applications
CN112417243A (en) * 2014-06-24 2021-02-26 谷歌有限责任公司 Search results for native applications
US10713324B2 (en) 2014-06-24 2020-07-14 Google Llc Search results for native applications
CN105630907B (en) * 2015-12-21 2019-05-07 北京大学 A kind of Android application assemble method based on application content
CN105630907A (en) * 2015-12-21 2016-06-01 北京大学 Method for assembling android application based on content of application
CN105930477A (en) * 2016-04-28 2016-09-07 北京小米移动软件有限公司 Information search method and device
CN105955618A (en) * 2016-04-29 2016-09-21 北京小米移动软件有限公司 Information display method and device
CN105955618B (en) * 2016-04-29 2019-07-05 北京小米移动软件有限公司 Information display method and device
US10372469B2 (en) 2016-04-29 2019-08-06 Beijing Xiaomi Mobile Software Co., Ltd. Method and device for displaying information
CN106934006A (en) * 2017-03-08 2017-07-07 中国银行股份有限公司 Web page recommendation method and device based on multi-fork tree-model
CN106934006B (en) * 2017-03-08 2020-07-10 中国银行股份有限公司 Page recommendation method and device based on multi-branch tree model
CN107861753A (en) * 2017-06-26 2018-03-30 平安普惠企业管理有限公司 APP generations index, search method and system and readable storage medium storing program for executing
CN108595529A (en) * 2018-03-30 2018-09-28 苏州风中智能科技有限公司 A kind of device of retrieval software function
CN110069463A (en) * 2019-03-12 2019-07-30 北京奇艺世纪科技有限公司 User behavior processing method, device electronic equipment and storage medium
CN110069463B (en) * 2019-03-12 2021-07-16 北京奇艺世纪科技有限公司 User behavior processing method, device electronic equipment and storage medium
CN112115228A (en) * 2019-06-19 2020-12-22 Oppo广东移动通信有限公司 Searching method, searching device, terminal and storage medium

Also Published As

Publication number Publication date
CN102968469B (en) 2016-05-18

Similar Documents

Publication Publication Date Title
CN102968469B (en) A kind of application references method for building up and system, application search method and system
CN110321482B (en) Information recommendation method, device and equipment
US10180967B2 (en) Performing application searches
Teitler et al. NewsStand: A new view on news
Zheng et al. A Study of Web Information Extraction Technology Based on Beautiful Soup.
Alex et al. Adapting the Edinburgh geoparser for historical georeferencing
CN103491205A (en) Related resource address push method and device based on video retrieval
CN106383887A (en) Environment-friendly news data acquisition and recommendation display method and system
CN105045852A (en) Full-text search engine system for teaching resources
CN104978314A (en) Media content recommendation method and device
Consonni et al. WikiLinkGraphs: a complete, longitudinal and multi-language dataset of the Wikipedia link networks
Li et al. Intelligent polar cyberinfrastructure: enabling semantic search in geospatial metadata catalogue to support polar data discovery
CN112000929A (en) Cross-platform data analysis method, system, equipment and readable storage medium
Kayed et al. Postal address extraction from the web: A comprehensive survey
Tomasic et al. Improving access to environmental data using context information
Ta’a et al. Ontology-based approach for knowledge retrieval in Al-Quran holy book
CN115168401A (en) Data grading processing method and device, electronic equipment and computer readable medium
Moura et al. Integration of linked data sources for gazetteer expansion
ElGindy et al. Capturing place semantics on the geosocial web
Wick et al. Enhancing RSS feeds with extracted geospatial information for further processing and visualization
Cho The recent trends of information organization research in Japan and Korea
CN103995849B (en) Event tracing method and system
Kleb et al. Ontology based entity disambiguation with natural language patterns
Huber et al. Agenames a stratigraphic information harvester and text parser
Pradeepa et al. Construction of gazetteers from geo big data using machine learning technique on Hadoop

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB02 Change of applicant information

Address after: 100085 Beijing city Haidian District Qinghe Street No. 68 Huarun colorful city shopping center two floor 13

Applicant after: Xiaomi Technology Co., Ltd.

Address before: 100102 Beijing Wangjing West Road, a volume of stone world building, A, block, floor 12

Applicant before: Beijing Xiaomi Technology Co., Ltd.

C14 Grant of patent or utility model
GR01 Patent grant