CN100456296C - Method for sequencing multi-medium file search engine - Google Patents

Method for sequencing multi-medium file search engine Download PDF

Info

Publication number
CN100456296C
CN100456296C CNB2006100905682A CN200610090568A CN100456296C CN 100456296 C CN100456296 C CN 100456296C CN B2006100905682 A CNB2006100905682 A CN B2006100905682A CN 200610090568 A CN200610090568 A CN 200610090568A CN 100456296 C CN100456296 C CN 100456296C
Authority
CN
China
Prior art keywords
rule
information
file
search engine
download link
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CNB2006100905682A
Other languages
Chinese (zh)
Other versions
CN101075238A (en
Inventor
余祥鑫
文杰
熊应
刘致远
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Shiji Guangsu Information Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CNB2006100905682A priority Critical patent/CN100456296C/en
Publication of CN101075238A publication Critical patent/CN101075238A/en
Application granted granted Critical
Publication of CN100456296C publication Critical patent/CN100456296C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

A method for sequencing search engine of multimedia file includes setting relevant rules in advance, obtaining download-chaining information of multimedia file from Internet by reptile, analyzing the sequencing rule to be atomic rule by detector and carrying out detecting and grading on said download-chaining information according to analyzed-out atomic rule then carrying out sequencing on download-chaining of multimedia file by indexer according result of said grading.

Description

A kind of sort method of multi-medium file search engine
Technical field
The present invention relates to the search engine technique field, more particularly, the present invention relates to a kind of sort method of multi-medium file search engine.
Background technology
Search engine technique is very popular in recent years technology, is that the Webpage search, news search, multi-medium file search, map search etc. of key foundation all have great practical value and commercial value with it.At present, various search engine techniques emerge in an endless stream, and relative various search are used also in the middle of develop rapidly.
Usually, multi-medium file search generally comprises Musicfile search, video file search and picture file search etc.The Musicfile search engine is the Mp3 search engine usually again, and it is retrieved and provide the information search of Mp3 and other various form music files and download unified resource descriptor (URL) based on search technique.Equally, the information search and the download URL of RM, WMV and other various format video files retrieved and provided to the video file search engine based on search technique; The information search and the URL of JPEG (joint photographic experts group) (JPEG) and other various format-pattern files retrieved and provided to the picture file search engine based on search technique.
Along with the continuous maturation of search technique, and the Internet user constantly increases the demand of multimedia file download service, and the competition of multi-medium file search in recent years is more and more fierce, and technical development is also more and more faster.Therefore, except needs improve Search Results (such as increasing the multimedia file number of links, reducing dead link etc.) in quantity, also must search quality be improved, to offer user's experience as well as possible.
In file search, need Search Results is sorted, and the ordering of Search Results is one of part the most key in the search experience.For multi-medium file search,, also need to provide some extra multimedia document information usually except going out the URL of multimedia file by search engine searches.Such as, for the Mp3 search engine, except the URL link that the Mp3 file is provided, also need to provide the information such as song title, singer's title, album name of Mp3 file.For another example, for the video file search engine, also need to provide the information such as title, performer's title of video file.The complete sum that guarantees these information rationally sorts, and is the basis of a good multi-medium file search engine.
Fig. 1 is the ordering synoptic diagram of multi-medium file search engine of the prior art.At first obtain the download link of multimedia file from the internet by reptile (Crawler), by detecting device (Detector) these download link are detected to detect chain alive wherein then, detecting device and to sending index after the chain marking ordering of living, set up search index by index (Index) again, at last direct control such as download from the internet according to the index of being set up by the user.Wherein, sequencing problem can be converted into the marking problem to Search Results substantially, mainly considers two aspects:
1, link itself and the anchor text (anchor) that reptile is grasped on webpage given a mark;
2, the Tag information of files such as Mp3, WMA is given a mark, Tag information is information such as the song title that has usually of multimedia file, singer, special edition.
In general, can take above two kinds of aspects into consideration and solve basic sequencing problem.Yet, development along with search technique, search deception (spam) technology also emerges in an endless stream, various deception search have been made at Tag information in a lot of websites, the marking of carrying out according to Tag tends to inaccurate like this, beat very high mark can for the deception website, even help the deception website to advertise, thereby seriously reduce user experience.
In addition, because the repetition probability of link of the page download of crawler capturing and anchor text is all bigger, therefore utilize the anchor text often can't distinguish two multimedia files inequality.Such as, a lot of anchor texts all are " click " or texts such as " audition ", " trying ", utilize these information can't distinguish its pairing multimedia file.
Not only therewith, because the fraud of webpage and Tag is ever-changing, and more hidden along with the time development, therefore be difficult to reach the effect that prevents to cheat and distinguish duplicate record with fixing rule.
Summary of the invention
In view of this, fundamental purpose of the present invention is the sort method that proposes a kind of multi-medium file search engine, dynamically to reduce even to overcome deception in the search procedure.
For achieving the above object, technical scheme of the present invention is achieved in that
A kind of sort method of multi-medium file search engine sets in advance at least one atomic rule, and further is provided with by the represented ordering rule of atomic rule, and this method is further comprising the steps of:
A, reptile obtain the download link information of multimedia file from the internet, comprise the download link of multimedia file in the described download link information at least;
B, detecting device resolve to atomic rule with described ordering rule, and described download link information is detected and give a mark according to the atomic rule that parses;
C, index sort to the download link of multimedia file according to the result of described marking.
Described atomic rule comprises any or the combination in any of at least one wherein in the following logic rules:
Information number percent comprises greater than preset value, information number percent that preset value, information number percent are not equal to preset value, information number percent equals preset value, abandons information, do not abandon information less than preset value, information number percent, and wherein information number percent is the ratio of certain information in total information.
Described setting by the represented ordering rule of atomic rule is: ordering rule is set to the regular expression of atomic rule;
The described detecting device of step B resolves to atomic rule with ordering rule: detecting device is analyzed described regular expression, so that described ordering rule is resolved to atomic rule.
Described ordering rule is kept in the text,
Detecting device is called in ordering rule among the step B from described text when initialization, so that ordering rule is resolved.
Described text is extend markup language (XML) file.
Further comprise Tag information in the described download link information, described ordering rule comprises:
If described Tag information the inside includes link, then from Tag information, do not obtain content and also the download link marking of this multimedia file is reduced.
Further comprise the anchor text in the described download link information, described ordering rule comprises:
If the identical anchor text of same website surpasses predetermined ratio, then do not obtain content from the anchor text of this website.
Described download link information further comprises anchor text and/or Tag information.
Described multimedia file comprises music file, video file or image file.
Described music file comprises: mp3 file, WMV file or RM file.
From technique scheme as can be seen, in the present invention, at first set in advance at least one atomic rule, and further be provided with by the represented ordering rule of atomic rule, from the internet, obtain the download link information of multimedia file then by reptile, comprise the download link of multimedia file in the described download link information at least; Then ordering rule is resolved to atomic rule, and download link information is detected and give a mark according to the atomic rule that parses by detecting device; According to the result of described marking the download link of multimedia file is sorted at last by index.
This shows, ordering rule among the present invention is represented by simple atomic rule, therefore upgrade very convenient rapid, can dynamically be written into ordering rule, and do not need to rewrite code, therefore realized that code and rule are separated, can formulate rule of response apace at various deceptions, thereby dynamically reduced even overcome deception in the search procedure.
In addition, in the present invention, if the identical anchor text of same website surpasses predetermined ratio, then do not obtain content, thereby overcome the defective that can't distinguish multimedia file and the deception of anchor text that anchor text of the prior art repeats to be brought from the anchor text of this website.
And, in the present invention,, then from Tag information, do not obtain content and also the download link marking of this multimedia file is reduced if Tag information the inside includes link.Therefore, the present invention can overcome the deception relevant with Tag information.
Description of drawings
Fig. 1 is the ordering synoptic diagram of multi-medium file search engine of the prior art.
Fig. 2 is the exemplary sort method according to multi-medium file search engine of the present invention.
Embodiment
For making the purpose, technical solutions and advantages of the present invention express clearlyer, the present invention is further described in more detail below in conjunction with drawings and the specific embodiments.
Main thought of the present invention is: at first set in advance at least one atomic rule, and further be provided with by the represented ordering rule of atomic rule, from the internet, obtain the download link information of multimedia file then by reptile, comprise the download link of multimedia file in the described download link information at least; Then ordering rule is resolved to atomic rule, and download link information is detected and give a mark according to the atomic rule that parses by detecting device; According to the result of described marking the download link of multimedia file is sorted at last by index.
Fig. 2 is the exemplary sort method according to multi-medium file search engine of the present invention.As shown in Figure 2, this method comprises:
Step 201: at least one atomic rule is set, and further is provided with by the represented ordering rule of atomic rule;
In any case complicated logic rules can form with simple logical combination.Can predesignate initial several logic rules, these basic logic rules are exactly atomic rule.Such as will " greater than ", " comprising ", " being not equal to ", less than " wait and to be write as atomic rule, nearly all like this marking is operated and is prevented that fraudulent operation can be made up of these atomic rules.Such as, atomic rule can comprise: information number percent comprises greater than preset value, information number percent that preset value, information number percent are not equal to preset value, information number percent equals preset value, abandons information, do not abandon information etc. less than preset value, information number percent.
After setting atomic rule, can be provided with by the represented ordering rule of atomic rule.
When in downloading link information, further comprising Tag information, ordering rule can for: if Tag information the inside includes link, then from Tag information, do not obtain content, and the download link marking of this multimedia file reduced.Use this rule, can avoid the deception in the Tag information, and overcome because the problem that can't distinguish multimedia file that the repetition of anchor text is brought.
When in downloading link information, further comprising the anchor text, ordering rule can for: if the identical anchor text of same website surpasses predetermined ratio, then do not obtain content from the anchor text of this website.Use this rule, can overcome by the identical problem that can't distinguish multimedia file that causes of anchor text.
Preferably ordering rule is set to the regular expression of atomic rule.Simultaneously, preferably utilize XML to deposit the advantage of the data of labyrinthization, ordering rule become the regular expression of above atomic rule, then by regular expression parser just can implementation rule textization.
XML be a kind of simple, with platform independence and the standard that is widely adopted, be a kind of meta-language that is used for defining other Languages.Briefly, XML provides a kind of method of description scheme data, and it has not only finished the task that HTML can not finish, and more internet world provides the instrument of " technical term " of definition all trades and professions.XML can be used for various application program, its essence is: XML is a kind of mode of representing data.Sometimes data are prepared for database, then are to read for the people sometimes.Use relevant technology with this two aspect, change also along with XML self grows up together such as data verification and XML.XML comprises checking or ability, file structure and document (in some sense) content of confirming.Can use the data of XML encapsulation in several ways.A kind of common processing mode is by using Extensible Stylesheet Language (XSL) conversion (ExtensibleStylesheet Language Transformations, XSLT), the developer can use the operation of XSLT definition to XML document, to generate specific result.The ability of this dynamic translation information allows to produce multiple output from the single source document, no matter outputs to different databases and still outputs to different browsers.
As seen, utilize XML can easily ordering rule be become the regular expression of above atomic rule.
Step 202: reptile obtains the download link information of multimedia file from the internet, comprises the download link of multimedia file in the described download link information at least;
Here, when the user carries out multi-medium file search, reptile is at first gathered the download link information of multimedia file from the internet, and these download link information are preserved according to certain Hash (Hash) order, except the download link (URL) of multimedia file, can further include anchor text and/or Tag information in the download link information.
Step 203: detecting device resolves to atomic rule with ordering rule, and described download link information is detected and gives a mark according to the atomic rule that parses;
Detecting device at first resolves to atomic rule with ordering rule.When ordering rule was set to the regular expression of atomic rule, detecting device was at first analyzed described regular expression, so that described ordering rule is resolved to atomic rule, according to the atomic rule that parses described download link information was detected then and gave a mark.
Wherein, detecting device detects the download link information of preserving, and therefrom detects efficient chain, and only according to the atomic rule that parses these chains of living is given a mark.Such as, for URL, be identical if surpass the anchor text of a certain threshold value from same website, just do not obtain information from the anchor text of this website, and the reduction of will giving a mark.
Step 204: index sorts to the download link of multimedia file according to the result of described marking.
Usually, the download link of the multimedia file that marking is higher sorts in the prostatitis of index, thereby can allow the faster discovery of user.Download link and relative additional information can also be sorted together in the prostatitis of index.
Below, with a concrete example the present invention is described in more details.Such as, be that example describes with the Mp3 search engine.
Suppose that for certain Mp3 search engine reptile is from the creeped download link information of many Mp3 files of network, these download link information comprise:
1, URL character string wherein might comprise song names;
2, anchor text might have song names in the anchor text, but also might be wrong or even deception;
3, the Tag information content has deception in these Tag information contents.
At this moment, need extract two information from above three information the inside: song information (as singer's name, song title) and ranking score.
At first, define some atomic rules.Table 1 is the exemplary signal table of atomic rule of the present invention.These atomic rules comprise: information number percent equals certain value, abandons information, does not abandon information less than certain value, information number percent greater than certain value, information number percent.Wherein, information number percent equals certain value etc. greater than certain value, information number percent less than certain value, information number percent and can belong to operation to information number percent (part of).The parameter of information number percent comprises 4, and parameter 1 is Tag, URL or anchor text; Parameter 2 is all information or site information; Parameter 3 is the number percent number; Parameter 4 is for being greater than or less than or equaling.Such as, part of (Tag, the website, 30, greater than) just expression: all belong to the Tag of this website to Tag information in correspondence, and whether ratio is greater than 30%.Abandon (Drop) and be used for representing whether abandon certain information, its parameter can have two, and parameter 1 is Tag, URL or anchor text; Parameter 2 is true/vacation.Such as, Drop (Tag, true) expression abandons the information of the Tag of this link.
Atomic operation Explanation Input Output For example
Information number percent PartOf Judge the ratio of certain information in total information Parameter 1:Tag/URL/ anchor text parameter 2: all/website parameter 3: number percent number of parameters 4: greater than/less than/equal Very/vacation Partof (Tag, the website, 30, greater than) expression: all belong to the Tag of this website to Tag information in correspondence, and whether ratio is greater than 30%
Abandon Drop Whether abandon certain information Parameter 1:tag/url/anchor parameter 2: true/vacation Do not have Drop (Tag, true) expression abandons the information of the Tag of this link.
Table 1
In addition, according to certain statistics, find that following ordering rule has great role to marking:
Ordering rule 1, from the URL of same website, be identical if surpass the anchor text of a certain threshold value (such as 30%), just do not obtain information from the anchor text of this website.
The foundation of this rule is from a hypothesis, and same exactly website can not a lot of links all be same first song.Such as, certain website anchor text nearly all is " click ", also the anchor text of some website much all is " audition ".And just in time have first Mp3 just to cry " singing audition well ", if do not abandon this anchor text message, cause erroneous judgement easily.Even the anchor text of number of site is made up of well-known song title entirely, to strengthen the mark of this website.Use after this rule, just can prevent the anchor text repetition caused can't distinguish different Mp3 songs, also can overcome the deception relevant with the anchor text.
If there is link the Tag information of 2 one songs of ordering rule the inside, does not just obtain information, and the URL of this song marking is reduced from Tag.
Exemplarily, write ordering rule 1 as form expression formula by atomic operation.Such as, when adopting the XML text:
Ordering rule 1:, just abandon the anchor text message if the anchor text is identical above 30%
Operation: Drop (Anchor , $1) , $1=Part of (Anchor, the website, 30, greater than)
XML is as follows:
<Rule>
<R?Name=”Partof”Return=”$1”>
<Para>Anchor</Para>
<Para〉website</Para 〉
<Para>30</Para>
<Para〉greater than</Para 〉
</R>
<R?Name=”Drop”Return=””>
<Para>Anchor</Para>
<Para>$1</Para>
</R>
</Rule>
Like this, just that ordering rule 1 is represented by atomic rule according to XML.Obviously, ordering rule 2 is similar also can be represented by atomic rule.Can loading rule be in internal memory from XML when detecting device starts, this rule is all checked in each Mp3 link then, and finishes function corresponding according to rule.
If ordering rule changes, only need to revise XML and restart process, modification rule that can be very fast, and need not revise and search for relevant code.Certainly, because atomic rule is flexibly, so these atomic rules can also be combined into other rule, so program must realize a certain amount of atomic rule at first, just can play the effect by the configuration file dynamic load.
According to test, if ordering rule changes, under the situation of multiserver, the update cycle in the actual motion can be controlled within 24 hours, therefore can provide dynamically give a mark result and favorable experience, anti-apace deception.
In the above process, be that example has been described in detail the present invention with the Mp3 search engine.Obviously, this only is exemplary, and is not used in protection scope of the present invention is limited.In fact, the present invention goes for various multi-medium file search engines such as Musicfile search engine, video file search engine, image file search engine.Preferably, when the search for music file, the form of these music files comprises mp3 file form, wma file form or RM file layout etc.
The above is preferred embodiment of the present invention only, is not to be used to limit protection scope of the present invention.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (10)

1, a kind of sort method of multi-medium file search engine is characterized in that, sets in advance at least one atomic rule, and further is provided with by the represented ordering rule of atomic rule, and this method is further comprising the steps of:
A, reptile obtain the download link information of multimedia file from the internet, comprise the download link of multimedia file in the described download link information at least;
B, detecting device resolve to atomic rule with ordering rule, and described download link information is detected and give a mark according to the atomic rule that parses;
C, index sort to the download link of multimedia file according to the result of described marking.
2, the sort method of multi-medium file search engine according to claim 1 is characterized in that, described atomic rule comprises any or the combination in any of at least one wherein in the following logic rules:
Information number percent comprises greater than preset value, information number percent that preset value, information number percent are not equal to preset value, information number percent equals preset value, abandons information, do not abandon information less than preset value, information number percent, and described information number percent is meant the ratio of described download link information in all site informations.
3, the sort method of multi-medium file search engine according to claim 1 is characterized in that, described setting by the represented ordering rule of atomic rule is: ordering rule is set to the regular expression of atomic rule;
The described detecting device of step B resolves to atomic rule with ordering rule: detecting device is analyzed described regular expression, so that described ordering rule is resolved to atomic rule.
4, the sort method of multi-medium file search engine according to claim 1 is characterized in that, described ordering rule is kept in the text,
Detecting device is called in ordering rule among the step B from described text when initialization, so that ordering rule is resolved.
5, the sort method of multi-medium file search engine according to claim 4 is characterized in that, described text is the expandable mark language XML file.
6, the sort method of multi-medium file search engine according to claim 1 is characterized in that, further comprises Tag information in the described download link information, and described ordering rule comprises:
If described Tag information the inside includes link, then from Tag information, do not obtain content and also the download link marking of this multimedia file is reduced.
7, the sort method of multi-medium file search engine according to claim 1 is characterized in that, further comprises the anchor text in the described download link information, and described ordering rule comprises:
If the identical anchor text of same website surpasses predetermined ratio, then do not obtain content from the anchor text of this website.
8, the sort method of multi-medium file search engine according to claim 1 is characterized in that, described download link information further comprises anchor text and/or Tag information.
9, the sort method of multi-medium file search engine according to claim 1 is characterized in that, described multimedia file comprises music file, video file or image file.
10, the sort method of multi-medium file search engine according to claim 9 is characterized in that, described music file comprises: mp3 file, wma file or RM file.
CNB2006100905682A 2006-06-28 2006-06-28 Method for sequencing multi-medium file search engine Active CN100456296C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006100905682A CN100456296C (en) 2006-06-28 2006-06-28 Method for sequencing multi-medium file search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006100905682A CN100456296C (en) 2006-06-28 2006-06-28 Method for sequencing multi-medium file search engine

Publications (2)

Publication Number Publication Date
CN101075238A CN101075238A (en) 2007-11-21
CN100456296C true CN100456296C (en) 2009-01-28

Family

ID=38976291

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006100905682A Active CN100456296C (en) 2006-06-28 2006-06-28 Method for sequencing multi-medium file search engine

Country Status (1)

Country Link
CN (1) CN100456296C (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8131708B2 (en) * 2008-06-30 2012-03-06 Vobile, Inc. Methods and systems for monitoring and tracking videos on the internet
KR101383573B1 (en) * 2008-08-01 2014-04-09 삼성전자주식회사 Electronic apparatus and web-information providing method thereof
CN102147800B (en) * 2010-02-10 2015-06-17 康佳集团股份有限公司 Internet karaoke song requesting method and system
CN104572651B (en) * 2013-10-11 2017-09-29 华为技术有限公司 Picture sort method and device
CN110033035A (en) * 2019-04-04 2019-07-19 武汉精立电子技术有限公司 A kind of AOI defect classification method and device based on intensified learning
CN115278365B (en) * 2022-09-26 2023-01-03 成都华栖云科技有限公司 Website video acquisition method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US20020152267A1 (en) * 2000-12-22 2002-10-17 Lennon Alison J. Method for facilitating access to multimedia content
US20030208482A1 (en) * 2001-01-10 2003-11-06 Kim Brian S. Systems and methods of retrieving relevant information
CN1710560A (en) * 2005-06-22 2005-12-21 浙江大学 Individual searching engine method based on linkage analysis
CN1755678A (en) * 2004-09-30 2006-04-05 微软公司 System and method for incorporating anchor text into ranking of search results

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US20020152267A1 (en) * 2000-12-22 2002-10-17 Lennon Alison J. Method for facilitating access to multimedia content
US20030208482A1 (en) * 2001-01-10 2003-11-06 Kim Brian S. Systems and methods of retrieving relevant information
CN1755678A (en) * 2004-09-30 2006-04-05 微软公司 System and method for incorporating anchor text into ranking of search results
CN1710560A (en) * 2005-06-22 2005-12-21 浙江大学 Individual searching engine method based on linkage analysis

Also Published As

Publication number Publication date
CN101075238A (en) 2007-11-21

Similar Documents

Publication Publication Date Title
Tidwell XSLT: mastering XML transformations
CN100410961C (en) XML streaming transformer
US8086959B2 (en) Method and system for inferring a schema from a hierarchical data structure for use in a spreadsheet
CN101361063B (en) System and method supporting document content mining based on rules
US8190991B2 (en) XSD inference
Skonnard et al. Essential XML quick reference
CA2448787C (en) Method and computer-readable medium for importing and exporting hierarchically structured data
US8775474B2 (en) Exposing common metadata in digital images
KR101311123B1 (en) Programmability for xml data store for documents
US20090019015A1 (en) Mathematical expression structured language object search system and search method
JP2008052662A (en) Structured document management system and program
US20110202532A1 (en) Information sharing system, information sharing method, and information sharing program
US8234288B2 (en) Method and device for generating reference patterns from a document written in markup language and associated coding and decoding methods and devices
CN100498771C (en) System and method for managing structured document
CN100456296C (en) Method for sequencing multi-medium file search engine
Salminen et al. Communicating with XML
JP4042830B2 (en) Content attribute information normalization method, information collection / service provision system, and program storage recording medium
JP2008107904A (en) Text and animation service apparatus, and computer program
Buchner et al. Data mining and XML: Current and future issues
JP3832693B2 (en) Structured document search and display method and apparatus
Kumar Apache Solr search patterns
Kucuk et al. Application of metadata concepts to discovery of internet resources
Esposito Applied XML programming for Microsoft. NET
Gao et al. Semi-structured data extraction from heterogeneous sources
JP2002082936A (en) Contents data displaying device and contents data displaying system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: SHENZHEN SHIJI LIGHT SPEED INFORMATION TECHNOLOGY

Free format text: FORMER OWNER: TENGXUN SCI-TECH (SHENZHEN) CO., LTD.

Effective date: 20131021

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518044 SHENZHEN, GUANGDONG PROVINCE TO: 518057 SHENZHEN, GUANGDONG PROVINCE

TR01 Transfer of patent right

Effective date of registration: 20131021

Address after: 518057 Tencent Building, 16, Nanshan District hi tech park, Guangdong, Shenzhen

Patentee after: Shenzhen Shiji Guangsu Information Technology Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518044 Zhenxing Road, SEG Science Park 2 East Room 403

Patentee before: Tencent Technology (Shenzhen) Co., Ltd.