CN101997915A - Deep packet detection device, webpage data processing method, and webpage data acquisition method and system - Google Patents

Deep packet detection device, webpage data processing method, and webpage data acquisition method and system Download PDF

Info

Publication number
CN101997915A
CN101997915A CN2010105320864A CN201010532086A CN101997915A CN 101997915 A CN101997915 A CN 101997915A CN 2010105320864 A CN2010105320864 A CN 2010105320864A CN 201010532086 A CN201010532086 A CN 201010532086A CN 101997915 A CN101997915 A CN 101997915A
Authority
CN
China
Prior art keywords
http protocol
protocol message
webpage
data acquisition
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010105320864A
Other languages
Chinese (zh)
Other versions
CN101997915B (en
Inventor
蔡逆水
陈强
杨俊�
蒋丹舟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Telecom Corp Ltd
Original Assignee
China Telecom Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Telecom Corp Ltd filed Critical China Telecom Corp Ltd
Priority to CN201010532086.4A priority Critical patent/CN101997915B/en
Publication of CN101997915A publication Critical patent/CN101997915A/en
Application granted granted Critical
Publication of CN101997915B publication Critical patent/CN101997915B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a webpage data processing method, a webpage data acquisition method, a deep packet detection device and a webpage data acquisition system. The webpage data acquisition method comprises the following steps: selectively capturing hyper text transport protocol (HTTP) messages of a flow webpage server according to a webpage address information base; analyzing the content of the captured HTTP messages; extracting the content of tag fields in the HTTP messages; and selectively acquiring the data in the captured HTTP messages according to the content of the tag fields. According to the invention, the deep packet detection technique with the webpage data acquisition technique are combined, the acquisition and analysis efficiency of the webpage data is improved, the cost for acquiring and analyzing mass data is reduced, and simultaneously the webpage data more accurately can be acquired because the tag fields are adopted.

Description

Deep-packet detection device, web data processing method, acquisition method and system
Technical field
The present invention relates to Internet technical field, especially, relate to a kind of deep-packet detection device, web data processing method, web data acquisition method and web data acquisition system.
Background technology
The fast development of using along with WEB technology and WEB, the application that various WEB are used centralized monitor, user data collection and statistical analysis in the website, particularly platform such as electronic channel, ecommerce also more and more widely.But,, therefore, in real work, need mass data is optionally gathered because the user data of platforms such as huge electronic channel of customer volume and ecommerce is a magnanimity.
Yet, existing webpage is not considered data collection problems at the beginning of design, and problem such as existing webpage ubiquity page address and image data are mixed and disorderly, accuracy is not high, therefore, be difficult to carry out efficient and data acquisition exactly based on existing webpage.
Summary of the invention
The technical problem that the present invention will solve provides a kind of deep-packet detection device, web data processing method, web data acquisition method and web data acquisition system, can efficiently and exactly gather the data of webpage.
According to an aspect of the present invention, propose a kind of web data processing method, comprised the data acquisition scope of determining the http protocol message of each webpage according to the data acquisition demand; In the http protocol message of each webpage, add label field, the data acquisition scope of the http protocol message of the content representation webpage of label field.
The web data processing method embodiment according to the present invention, label field is arranged in the header fields of http protocol message of each webpage.
Another embodiment of web data processing method according to the present invention, the data acquisition scope of the http protocol message of webpage comprises the total data extracted in the http protocol message, extract the partial data in the http protocol message and do not extract any data in the http protocol message.
According to a further aspect in the invention, also proposed a kind of web data acquisition method, comprised according to the web page address information bank and optionally grasp the http protocol message that flows to web page server; The http protocol content of message that parsing grabs; Extract the content of the label field in the http protocol message; Content according to label field is carried out the selectivity collection to the data in the http protocol message that grabs.
The web data acquisition method embodiment according to the present invention forms the http protocol message that flows to web page server: the data acquisition scope of determining the http protocol message of each webpage according to the data acquisition demand by following step; In the http protocol message of each webpage, add label field, form the http protocol message that flows to web page server, wherein, the data acquisition scope of the http protocol message of the content representation webpage of label field.
Another embodiment of web data acquisition method according to the present invention, label field is arranged in the header fields of http protocol message of each webpage.
The another embodiment of web data acquisition method according to the present invention, the data acquisition scope of the http protocol message of webpage comprises the total data extracted in the http protocol message, extract the partial data in the http protocol message and do not extract any data in the http protocol message.
According to another aspect of the invention, also proposed a kind of deep-packet detection device, comprised address screening module, be used for optionally grasping the http protocol message that flows to web page server according to the web page address information bank; The packet parsing module links to each other with address screening module, is used to resolve the http protocol content of message that grabs; The label substance extraction module links to each other with the packet parsing module, is used for extracting the content of the label field of http protocol message, wherein, and the data acquisition scope of the http protocol message of the content representation webpage of label field; Data acquisition module links to each other with the label substance extraction module, is used for according to the content of label field the data of the http protocol message that grabs being carried out the selectivity collection.
The deep-packet detection device embodiment according to the present invention, label field is arranged in the header fields of the http protocol message that flows to web page server.
Another embodiment of deep-packet detection device according to the present invention, the data acquisition scope of the http protocol message of webpage comprises the total data extracted in the http protocol message, extract the partial data in the http protocol message and do not extract any data in the http protocol message.
In accordance with a further aspect of the present invention, a kind of web data acquisition system has also been proposed, comprise deep-packet detection device and web data processing unit in the foregoing description, wherein, the web data processing unit comprises the acquisition range determination module, is used for determining according to the data acquisition demand data acquisition scope of the http protocol message of each webpage; Data processing module, link to each other with the acquisition range determination module, be used for adding label field, form the http protocol message that flows to web page server at the http protocol message of each webpage, wherein, the data acquisition scope of the http protocol message of the content representation webpage of label field.
Deep-packet detection device provided by the invention, web data processing method, web data acquisition method and web data acquisition system, can be with deep-packet detection (Deep Packet Inspection, DPI) technology combines with the web data acquisition technique, promoted collecting efficiency, reduced the cost that mass data is gathered and analyzed web data.Simultaneously, owing to adopt label field, thus can determine the data acquisition scope of webpage more accurately, thus the accuracy of data acquisition improved.
Description of drawings
Accompanying drawing described herein is used to provide further understanding of the present invention, constitutes the application's a part.In the accompanying drawings:
Fig. 1 is the schematic flow sheet of an embodiment of web data processing method of the present invention.
Fig. 2 is the schematic flow sheet of an embodiment of web data acquisition method of the present invention.
Fig. 3 is the schematic flow sheet of the another embodiment of web data acquisition method of the present invention.
Fig. 4 is the structural representation of an embodiment of deep-packet detection device of the present invention.
Fig. 5 is the structural representation of an embodiment of web data acquisition system of the present invention.
Embodiment
With reference to the accompanying drawings the present invention is described more fully, exemplary embodiment of the present invention wherein is described.Exemplary embodiment of the present invention and explanation thereof are used to explain the present invention, but do not constitute improper qualification of the present invention.
Below be illustrative to the description only actually of at least one exemplary embodiment, never as any restriction to the present invention and application or use.
The present invention combines DPI technology and WEB web data acquisition technique, on the basis of having analyzed DPI selective data acquisition principle, in order to promote the efficient of collection analysis, proposed to be convenient to web data processing method, web data acquisition method, deep-packet detection device and the web data acquisition system that DPI gathers.
When carrying out the collection of DPI selectivity, at first need to set up a storehouse, store page address to be collected, each asks to carry out address lookup according to this storehouse earlier behind the server, if the page address in the address of webpage and the storehouse is complementary, then extract the content of the page.
Fig. 1 is the schematic flow sheet of an embodiment of web data processing method of the present invention.
As shown in Figure 1, this embodiment can may further comprise the steps:
S102 is according to the definite scope that the http protocol message of each webpage is carried out data acquisition of data acquisition demand;
S104, in the http protocol message of each webpage, add label field, the content representation of this label field is carried out the scope of data acquisition to the http protocol message of webpage, wherein, this label field can be arranged in any position of http protocol message, preferably, label field can be arranged in the header fields of http protocol message of each webpage.
In addition, the scope that the http protocol message of webpage is carried out data acquisition can comprise that the total data extracted in the http protocol message (promptly, comprise heading all data to message trailer), extract the partial data (for example, IP address, user name, page address, access time, login classification and page parameter etc.) in the http protocol message and do not extract any data in the http protocol message.
This embodiment is carrying out having considered the DPI technology when web data is handled, on the basis of having analyzed DPI selective data acquisition principle, proposed to be convenient to the web data processing method that DPI gathers, this embodiment can significantly promote the collecting efficiency of web data, and improves the accuracy of data acquisition.
In another embodiment of web data processing method of the present invention, at first need standard (is for example carried out in the page address, http: // 202.23.24.153/news/sports, represent the sports content in the news), then, the WEB website and webpage are divided into different levels, corresponding different data acquisition scopes, add label field again in the http protocol message of webpage, the content of this label field is corresponding to different data acquisition scopes.According to the RFC protocol specification, the header fields of http protocol message can embed the custom field content according to concrete application need, therefore, can when the electronic channel webpage is realized, embed self-defining HTTP header fields information (, label field), at different data acquisition demands, webpage is embedded different self-defined informations, thereby realize level classification, can get ready for data acquisition further webpage.
Fig. 2 is the schematic flow sheet of an embodiment of web data acquisition method of the present invention.
As shown in Figure 2, this embodiment can may further comprise the steps:
S202, optionally grasp the http protocol message that flows to web page server according to the web page address information bank, wherein, can store the page address of waiting to grasp webpage in this web page address information bank, the requirement of satisfying the web page address information bank in the address of the page that flows to web page server (for example, this page address is stored in the web page address information bank) time, just crawled and carry out follow-up packet parsing and data extract;
S204 resolves the http protocol content of message that grabs;
S206, the content of the label field in the extraction http protocol message;
S208 carries out the selectivity collection according to the content of label field to the data in the http protocol message that grabs.
Wherein, can form the http protocol message that flows to web page server by following step: according to the definite scope that the http protocol message of each webpage is carried out data acquisition of data acquisition demand; Add label field in the http protocol message of each webpage, form the http protocol message that flows to web page server, wherein, the content representation of label field is carried out the scope of data acquisition to the http protocol message of webpage.
In an example, the scope that the http protocol message of webpage is carried out data acquisition can comprise that the total data extracted in the http protocol message (promptly, comprise heading all data to message trailer), extract the partial data (for example, IP address, user name, page address, access time, login classification and page parameter etc.) in the http protocol message and do not extract any data in the http protocol message.
Alternatively, label field can be arranged in any position of the http protocol message of each webpage, preferably, label field can be arranged in the header fields of http protocol message of each webpage.
This embodiment at first according to web page address information bank screening webpage to be collected, has reduced the interference of mass data to a great extent when carrying out data acquisition.Further, this embodiment also resolves the HTTP header fields content of the webpage that grasps, extract self-defining header fields label substance, take different data acquisitions to extract flow process according to the content of label, for example, can extract the http protocol message full content, extract the partial content of http protocol message or do not extract any content, thereby realize being with optionally data acquisition, reduce the pressure of mass data, improved the efficient and the accuracy of data acquisition simultaneously for technology and cost.
In another embodiment of web data acquisition method of the present invention,, extract specifying information in the relevant position of data content to be collected according to the content of the label field that parses according to the http protocol message of RFC protocol specification resolution flow to the WEB Website server.Particularly, the DPI device is resolved corresponding self-defined header fields content (that is, the content of label field) when handling http protocol, call different data acquisition flows according to the definition of self-defined header fields content, to realize the extraction of web data.The self-defined content that http protocol head field embeds can be divided into label and two parts of content, self-defining header fields can arrange to start with " X-", for example, " X-type:0 " can represent to extract all the elements of http protocol message, and " X-type:1 " can represent only to extract the URL address.According to the level needs of data acquisition content, can define one or more self-defined header tag, give different contents respectively, different data are extracted in representative.
Fig. 3 is the schematic flow sheet of the another embodiment of web data acquisition method of the present invention.
As shown in Figure 3, this embodiment can may further comprise the steps:
S302 builds the DPI acquisition system, gathers the website with target and carries out data image;
S304 sets up the web page address information bank, has wherein stored the address of waiting to grasp webpage;
S306, set up selectivity and resolve the content depth information bank, wherein store the data acquisition of different customized label correspondences and resolved subprogram, for example, the employed total data collection of full content of extracting the http protocol message is resolved subprogram, is extracted the employed partial data collection parsing of the partial content subprogram of http protocol message etc.;
S308 carries out selectivity according to the web page address information bank to the page that flows to web page server and grasps;
S310, the data that storage is grasped;
S312, the http protocol content of message of the page that parsing grabs is carried out the selectivity collection according to the content of the label field in the http protocol message to the data in the http protocol message that grabs;
S314, the data after the classification storing and resolving.
Fig. 4 is the structural representation of an embodiment of deep-packet detection device of the present invention.
As shown in Figure 4, the deep-packet detection device 10 of this embodiment can comprise:
Screening module 11 in address is used for optionally grasping the http protocol message that flows to web page server according to the web page address information bank;
Packet parsing module 12 links to each other with address screening module, is used to resolve the http protocol content of message that grabs;
Label substance extraction module 13, link to each other with the packet parsing module, be used for extracting the content of the label field of http protocol message, wherein, the data acquisition scope of the http protocol message of the content representation webpage of label field, alternatively, the data acquisition scope of the http protocol message of webpage can comprise the total data extracted in the http protocol message, extracts the partial data in the http protocol message and not extract any data in the http protocol message;
Data acquisition module 14 links to each other with the label substance extraction module, is used for according to the content of label field the data of the http protocol message that grabs being carried out the selectivity collection.
Alternatively, label field can be arranged in the header fields of the http protocol message that flows to web page server.
This embodiment at first according to web page address screening webpage to be collected, has reduced the processing to magnanimity to a great extent when carrying out data acquisition.In addition, this embodiment also resolves the HTTP header fields content of the webpage that grasps, extract self-defining header fields label substance, take different data acquisitions to extract flow process according to the content of label, can extract the http protocol message full content, extract the partial content of http protocol message or do not extract any content etc., thereby realize being with optionally data acquisition, reduce the pressure of mass data, improved the efficient and the accuracy of data acquisition simultaneously for technology and cost.
Fig. 5 is the structural representation of an embodiment of web data acquisition system of the present invention.
As shown in Figure 5, the web data acquisition system of this embodiment can comprise deep-packet detection device 10 and the web data processing unit 21 in the previous embodiment, and wherein, web data processing unit 21 comprises:
Acquisition range determination module 211 is used for determining according to the data acquisition demand data acquisition scope of the http protocol message of each webpage;
Data processing module 212, link to each other with the acquisition range determination module, be used for adding label field, form the http protocol message that flows to web page server at the http protocol message of each webpage, wherein, the data acquisition scope of the http protocol message of the content representation webpage of label field.
Though specific embodiments more of the present invention are had been described in detail by example, it should be appreciated by those skilled in the art that above example only is in order to describe, rather than in order to limit the scope of the invention.It should be appreciated by those skilled in the art, can under situation about not departing from the scope of the present invention with spirit, above embodiment be made amendment.Scope of the present invention is limited by claims.

Claims (11)

1. a web data processing method is characterized in that, comprising:
Determine the data acquisition scope of the http protocol message of each webpage according to the data acquisition demand;
In the http protocol message of each webpage, add label field, the data acquisition scope of the http protocol message of the content representation webpage of described label field.
2. method according to claim 1 is characterized in that, described label field is arranged in the header fields of http protocol message of described each webpage.
3. method according to claim 1, it is characterized in that the data acquisition scope of the http protocol message of described webpage comprises the total data extracted in the http protocol message, extract the partial data in the http protocol message and do not extract any data in the http protocol message.
4. a web data acquisition method is characterized in that, comprising:
Optionally grasp the http protocol message that flows to web page server according to the web page address information bank;
The http protocol content of message that parsing grabs;
Extract the content of the label field in the described http protocol message;
Content according to described label field is carried out the selectivity collection to the data in the described http protocol message that grabs.
5. method according to claim 4 is characterized in that, forms the described http protocol message that flows to web page server by following step:
Determine the data acquisition scope of the http protocol message of each webpage according to the data acquisition demand;
In the http protocol message of each webpage, add described label field, form the described http protocol message that flows to web page server, wherein, the data acquisition scope of the http protocol message of the content representation webpage of described label field.
6. according to claim 4 or 5 described methods, it is characterized in that described label field is arranged in the header fields of http protocol message of described each webpage.
7. method according to claim 5, it is characterized in that the data acquisition scope of the http protocol message of described webpage comprises the total data extracted in the http protocol message, extract the partial data in the http protocol message and do not extract any data in the http protocol message.
8. a deep-packet detection device is characterized in that, comprising:
Screening module in address is used for optionally grasping the http protocol message that flows to web page server according to the web page address information bank;
The packet parsing module links to each other with described address screening module, is used to resolve the http protocol content of message that grabs;
The label substance extraction module links to each other with described packet parsing module, is used for extracting the content of the label field of described http protocol message, wherein, and the data acquisition scope of the http protocol message of the content representation webpage of described label field;
Data acquisition module links to each other with described label substance extraction module, is used for according to the content of described label field the data of the described http protocol message that grabs being carried out the selectivity collection.
9. device according to claim 8 is characterized in that, described label field is arranged in the header fields of the described http protocol message that flows to web page server.
10. device according to claim 8, it is characterized in that the data acquisition scope of the http protocol message of described webpage comprises the total data extracted in the http protocol message, extract the partial data in the http protocol message and do not extract any data in the http protocol message.
11. a web data acquisition system is characterized in that, comprises each described deep-packet detection device and web data processing unit among the claim 8-10, wherein, described web data processing unit comprises:
The acquisition range determination module is used for determining according to the data acquisition demand data acquisition scope of the http protocol message of each webpage;
Data processing module, link to each other with described acquisition range determination module, be used for adding described label field at the http protocol message of each webpage, form the described http protocol message that flows to web page server, wherein, the data acquisition scope of the http protocol message of the content representation webpage of described label field.
CN201010532086.4A 2010-10-29 2010-10-29 Deep packet detection device, webpage data processing method, and webpage data acquisition method and system Active CN101997915B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010532086.4A CN101997915B (en) 2010-10-29 2010-10-29 Deep packet detection device, webpage data processing method, and webpage data acquisition method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010532086.4A CN101997915B (en) 2010-10-29 2010-10-29 Deep packet detection device, webpage data processing method, and webpage data acquisition method and system

Publications (2)

Publication Number Publication Date
CN101997915A true CN101997915A (en) 2011-03-30
CN101997915B CN101997915B (en) 2014-01-08

Family

ID=43787485

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010532086.4A Active CN101997915B (en) 2010-10-29 2010-10-29 Deep packet detection device, webpage data processing method, and webpage data acquisition method and system

Country Status (1)

Country Link
CN (1) CN101997915B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103685298A (en) * 2013-12-23 2014-03-26 上海交通大学无锡研究院 Deep packet inspection based SSL (Secure Sockets Layer) man-in-the-middle attack discovering method
CN103888307A (en) * 2012-12-20 2014-06-25 中国电信股份有限公司 Method, user side board card and broadband access gateway used for optimizing deep packet detection
CN104486157A (en) * 2014-12-16 2015-04-01 国家电网公司 Information system performance detecting method based on deep packet analysis

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020091755A1 (en) * 2000-06-30 2002-07-11 Attila Narin Supplemental request header for applications or devices using web browsers
CN1402156A (en) * 2001-08-22 2003-03-12 威瑟科技股份有限公司 Web site information extracting system and method
WO2007101478A1 (en) * 2006-03-09 2007-09-13 Tecs Research And Development Limited A method of monitoring online banner activity
CN101094135A (en) * 2006-06-23 2007-12-26 腾讯科技(深圳)有限公司 Method and system for extracting information of content in Internet
CN101399749A (en) * 2007-09-27 2009-04-01 华为技术有限公司 Method, system and device for packet filtering
CN101556609A (en) * 2009-05-19 2009-10-14 杭州信杨通信技术有限公司 Customer behavior analysis and service system based on web contents
CN101667182A (en) * 2008-09-05 2010-03-10 华为技术有限公司 Method, system and device for performing secondary operation on web pages

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020091755A1 (en) * 2000-06-30 2002-07-11 Attila Narin Supplemental request header for applications or devices using web browsers
CN1402156A (en) * 2001-08-22 2003-03-12 威瑟科技股份有限公司 Web site information extracting system and method
WO2007101478A1 (en) * 2006-03-09 2007-09-13 Tecs Research And Development Limited A method of monitoring online banner activity
CN101094135A (en) * 2006-06-23 2007-12-26 腾讯科技(深圳)有限公司 Method and system for extracting information of content in Internet
CN101399749A (en) * 2007-09-27 2009-04-01 华为技术有限公司 Method, system and device for packet filtering
CN101667182A (en) * 2008-09-05 2010-03-10 华为技术有限公司 Method, system and device for performing secondary operation on web pages
CN101556609A (en) * 2009-05-19 2009-10-14 杭州信杨通信技术有限公司 Customer behavior analysis and service system based on web contents

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
中华人民共和国工业和信息化部: "《中华人民共和国通信行业标准》", 15 June 2009 *
韩树人 等: "基于嵌入式Web服务器的远程实时数据采集", 《计算机技术与发展》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103888307A (en) * 2012-12-20 2014-06-25 中国电信股份有限公司 Method, user side board card and broadband access gateway used for optimizing deep packet detection
CN103888307B (en) * 2012-12-20 2017-11-17 中国电信股份有限公司 For optimizing method, user side board and the broad access network gate of deep-packet detection
CN103685298A (en) * 2013-12-23 2014-03-26 上海交通大学无锡研究院 Deep packet inspection based SSL (Secure Sockets Layer) man-in-the-middle attack discovering method
CN104486157A (en) * 2014-12-16 2015-04-01 国家电网公司 Information system performance detecting method based on deep packet analysis

Also Published As

Publication number Publication date
CN101997915B (en) 2014-01-08

Similar Documents

Publication Publication Date Title
CN104063401B (en) The method and apparatus that a kind of webpage pattern address merges
CN103297270A (en) Application type recognition method and network equipment
US20130191890A1 (en) Method and system for user identity recognition based on specific information
CN102938789B (en) Download combination analysis method and device for mobile internet mobile phone applications
RU2013107787A (en) RECEIVING DEVICE, RECEIVING METHOD, TRANSMISSION DEVICE, TRANSMISSION METHOD, PROGRAM AND BROADCAST TRANSMISSION SYSTEM
CN102752288A (en) Method and device for identifying network access action
EP2584800A3 (en) Digital system and method of processing service data thereof
CN102870118B (en) Access method, device and system to user behavior
CN101441629A (en) Automatic acquiring method of non-structured web page information
CN105100174A (en) Method, device and system for scheduling Internet resource
CN102801698B (en) Uniform resource locator (URL) request time sequence-based detection method and system for malicious codes
CN107870849A (en) The treating method and apparatus of test log
CN102012894A (en) Method and system for displaying documents by terminals
CN102752371A (en) Method for achieving splash on client side and client side
CN101997915B (en) Deep packet detection device, webpage data processing method, and webpage data acquisition method and system
CN109698798A (en) A kind of recognition methods of application, device, server and storage medium
CN106993016B (en) Network request and the treating method and apparatus of response
CN105530218A (en) Link security detection method and client
CN101937466A (en) Webpage mailbox identification classifying method and system
CN103246675B (en) A kind of method and apparatus for being used to capture website data
CN103164213A (en) Method, device and system of testing compatibility of Web browser
CN102073678A (en) System and method for analyzing information of websites
CN104281680A (en) Data processing system, method and device for acquiring website resources
CN103455483B (en) The collection processing method and system of search in Website data
CN102819613B (en) RSS information paging grasping system and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant