CN104933041A - File extraction and reduction method favorable for translation work - Google Patents

File extraction and reduction method favorable for translation work Download PDF

Info

Publication number
CN104933041A
CN104933041A CN201510357672.2A CN201510357672A CN104933041A CN 104933041 A CN104933041 A CN 104933041A CN 201510357672 A CN201510357672 A CN 201510357672A CN 104933041 A CN104933041 A CN 104933041A
Authority
CN
China
Prior art keywords
translation
sentence
interpreter
document
translated
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510357672.2A
Other languages
Chinese (zh)
Other versions
CN104933041B (en
Inventor
江潮
罗伟峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Language network (Wuhan) Information Technology Co., Ltd.
Original Assignee
WUHAN TRANSN INFORMATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WUHAN TRANSN INFORMATION TECHNOLOGY Co Ltd filed Critical WUHAN TRANSN INFORMATION TECHNOLOGY Co Ltd
Priority to CN201510357672.2A priority Critical patent/CN104933041B/en
Publication of CN104933041A publication Critical patent/CN104933041A/en
Priority to PCT/CN2015/098668 priority patent/WO2016206336A1/en
Application granted granted Critical
Publication of CN104933041B publication Critical patent/CN104933041B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language

Abstract

The invention relates to an artificial intelligence file processing method favorable for translation work. The method comprises the following steps: utilizing an Aspose assembly to support a file processing operation, and disassembling a file object to be translated into a set of data to be translated, wherein the set of the data to be translated takes a simple sentence as a minimum unit; establishing a standard interpreter processing document, copying each sentence in the set of data to be translated to the interpreter processing document one by one; filling translations in the interpreter processing document by an interpreter one by one; traversing the set of data to be translated and the interpreter processing document, and writing the translations into the set of data to be translated; and reducing the set of data to be translated to an original manuscript format document. Various manuscripts of different formats can be converted into the standard interpreter processing document. Sentences which repeatedly appear for multiple times do not need to be repeatedly translated for multiple times, the translation work of the interpreter is simplified, translation efficiency is improved, the execution efficiencies of extraction logics and reduction logics are high, and a reduced translation manuscript reserves an original manuscript format.

Description

A kind of file being beneficial to translation extracts and method of reducing
Technical field
The present invention relates to and be a kind ofly beneficial to the artificial intelligence of translation, the method for document process.
Background technology
Rank among second-biggest-in-the-world economy along with China, the steady enforcement of strategies such as " one band one tunnel ", contacting of Chinese each field and the world is more tight.Language Service supportive market required for communication exchange between multinational in international process seems more voluminous, and this brings new opportunities and challenges to translation industry.
The practitioner of translation industry every day will in the face of the contribution of the needs translation of a large amount of various forms, of a great variety due to contribution, corresponding translator just needs to be grasped all kinds of documentor as Word, Excel, the use of PPT, PDF and the use of all kinds of document assisted translation tools.This for sole duty translator be very large challenge and threshold, and even clearly this kind of problem hindered to whole industry development China globalization process.
Therefore, need to propose a kind of by etc. multiple main flow document format be converted into the document of unified standard pattern and also can will transform the standard document that obtains and be reduced to the method for original copy form conversely.To simplify translation, to improve translation efficiency.
Summary of the invention
Technical matters to be solved by this invention simplifies translation, raising translation efficiency, proposes a kind of file being beneficial to translation and extract and method of reducing.
For solving the problems of the technologies described above, the file being beneficial to translation of the present invention's proposition extracts and method of reducing, comprises the following steps:
1) utilize Aspose dynamic link library to the operation support of document process, disassembling into document object to be translated with simple sentence is the set data to be translated of least unit;
2) set up an interpreter and process document, described interpreter processes document and is provided with " original text ", " translation " and id tri-fields, the corresponding sentence original text of described " original text " field, the corresponding sentence translation of " translation " field;
3) each sentence in the described set data to be translated taking simple sentence as least unit is copied to " original text " field that described interpreter processes document in order one by one, then the content of this sentence in the set data to be translated occupy-place symbol Guid that has uniqueness is substituted, and adjacent occupy-place symbol Guid has different character format; The content of described id field has between described different Guid the relation mapped one by one;
4) described interpreter is processed document and be issued to interpreter, described interpreter processes the original text translating " original text " field in document one by one described interpreter, is filled up to corresponding " translation " field, until processed;
5) travel through described set data to be translated and interpreter processes document, the different ids corresponding according to different Guid, find the translation that this id is corresponding, cover the position of writing this corresponding Guid in described set data to be translated.
6) call Aspose dynamic link library, described set data to be translated is reduced to the document of original copy form.
Described disassemble into document object to be translated with sentence is the set data to be translated of least unit, comprises the following steps:
1-1 calls Aspose assembly;
1-2 travels through described document object, obtains whole paragraph object, and described paragraph object comprises the whole Word message of document object, and does not comprise symbol, image or other the non-legible information without the need to translating;
1-3 travels through the child node object of each paragraph object, thus obtains several character set object Run.There is provided paragraph object, child node object in Aspose assembly, and facilitate the Run object of character manipulation, described Run is to liking the character fragments set that in document, continuous print character format is consistent.
1-4 travels through each Run object, whole Run object is split into the Run object only including a complete sentence one by one, or for only including the Run object of a sentence fragment;
1-5 travels through each Run object, follow-up only to be included in the Run object of a complete sentence by the Run object merging only including sentence fragment to it.
After completing, obtaining take sentence as least unit, only to include the set of the Run object of a complete sentence one by one.
Describedly will only include the Run object merging of a sentence fragment to follow-up Run object, comprise the following steps:
The character content that 1-4-1 will only be the Run object of a sentence fragment is taken out, and leaves temporary storage cell in, then in paragraph object, deletes this Run object;
1-4-2 checks next Run object, if the character content of this Run object is only a sentence fragment, then takes out the character content of this Run object, add temporary storage cell to, then in paragraph object, delete this Run object, continue to check next but one Run object; Otherwise, take out temporary storage cell and deposit character content, before adding the character content of this next Run object to, then empty described temporary storage cell.
If the character content of this next Run object of 1-4-3 is using sentence end mark as ending, the character content then deposited by described temporary storage cell is taken out, before adding the character content of this next Run object to, then empty described temporary storage cell.
The present invention also comprises, and sets up a dictionary object, and the key of described dictionary object is original text, and value is translation, and original text-translation is a key-value pair; When traveling through described interpreter and processing document, original text-translation corresponding in recording, writes described dictionary object respectively.
In step 5) in, if the translation hurdle of an id place record is empty, then in described dictionary object, with the original text of this id place record for key removes to search the translation value whether having coupling, if found, fill translation hurdle with this translation.
If in described dictionary object, do not find the translation value of coupling, then this sentence is that leakage is translated, and directly uses original text to fill, and the personnel that conveniently examine and revise find.
Further, described interpreter being processed before document is issued to interpreter, traveling through described interpreter and processing document, the sentence of repetition is marked, reminding interpreter not need to repeat translation.
Further, described interpreter being processed before document is issued to interpreter, traveling through described interpreter and processing document, sentence in original text and the term in terminology bank are carried out Auto-matching, if coupling, then annotate and comment on this term sentence, make translation more smooth and easy.
Further, described interpreter being processed before document is issued to interpreter, traveling through described interpreter and processing document, the sentence in original text is mated with the language material comparison in corpus one by one, if coupling, be then filled up to the language material translation in corpus in " translation " field corresponding to this coupling sentence.
Beneficial effect: the present invention simplifies the work of translator, makes translator without the need to grasping the use disposal route of all kinds of main flow documentor as PPT, Word, EXCL, PDF, thus more energy can be had to be absorbed in the work of character translation go.In addition, by the manuscript translated needed for preanalysis automatic in processing procedure, the sentence searching out repeatability marks, and the sentence of all repeatability all only needs translation once, other then automatic filling generations; Collect each translation achievement, the information such as the language material accumulating out before can directly using when receiving new contribution and term, further improve translation efficiency.
Accompanying drawing explanation
Below in conjunction with the drawings and specific embodiments, technical scheme of the present invention is further described in detail.
Fig. 1 is that the interpreter of the specific embodiment of the invention translates process interface sectional drawing, and the interpreter having filled original text of the main presentation in figure processes document.
Fig. 2 is that another interpreter of the specific embodiment of the invention translates process interface sectional drawing, and the main presentation in figure one processes document through pretreated interpreter.
Fig. 3 is overall flow figure of the present invention.
Embodiment
The file being beneficial to translation of the present invention's proposition extracts and method of reducing, comprises the following steps:
1) utilize Aspose dynamic link library to the operation support of document process, disassembling into document object to be translated with simple sentence is the set data to be translated of least unit;
2) set up an interpreter and process document, described interpreter processes document and is provided with " original text ", " translation " and id tri-fields, the corresponding sentence original text of described " original text " field, the corresponding sentence translation of " translation " field;
3) each sentence in the described set data to be translated taking simple sentence as least unit is copied to " original text " field that described interpreter processes document in order one by one, then the content of this sentence in the set data to be translated occupy-place symbol Guid that has uniqueness is substituted, and adjacent occupy-place symbol Guid has different character format; The content of described id field has between described different Guid the relation mapped one by one;
4) described interpreter is processed document and be issued to interpreter, described interpreter processes the original text translating " original text " field in document one by one described interpreter, is filled up to corresponding " translation " field, until processed;
5) travel through described set data to be translated and interpreter processes document, the different ids corresponding according to different Guid, find the translation that this id is corresponding, cover the position of writing this corresponding Guid in described set data to be translated.
6) call Aspose dynamic link library, described set reduction data to be translated is generated the translation manuscript that document processing tools identifies.
For more understanding the present invention, below with the process of Word document, be translated as example, describe translation processing procedure of the present invention in detail:
S1, call Aspose assembly;
S2, travel through Word document object to be translated, obtain whole paragraph object, this paragraph object comprises the whole Word message of document object, and does not comprise without the need to the symbol of translation, image or other non-legible information;
S3, travel through the child node object of each paragraph object, thus obtain several character set object Run;
Aspose assembly provides paragraph object, child node object, and facilitates the Run object of character manipulation; Run is to liking the character set that in document, continuous print character format is consistent.
There are 4 kinds of situations in the Run object obtained: 1. a Run object includes multiple complete sentence; 2. a Run object includes multiple complete sentence and some sentence fragments; 3. a Run object only comprises a sentence fragment; 4. Run object comprises a complete sentence.Therefore, need process of further making pauses in reading unpunctuated ancient writings, existing Run object is split and merged, obtain only including one by one the Run object of a complete sentence.
S4, fractionation: travel through each Run object, split into the Run object only including a complete sentence one by one by whole Run object, or for only including the Run object of a sentence fragment.The mode adopted is such as:
From first described Run object, check the character content of this Run object,
If Run object only includes a complete sentence, or a sentence fragment, then directly check next Run object;
If a Run object includes multiple complete sentence, then by this Run object with sentence end mark for boundary, split into several Run object only comprising a complete words.
If a Run object includes multiple complete sentence and a sentence fragment, then by this Run object with sentence end mark for boundary, split into several Run object only comprising a complete words, and a Run object comprising a sentence fragment.
Such as, passage for " in order to solve the problem, special propose a kind of by Word, Excel; the multiple main flow such as PPT, PDF document format is converted into the Word document of unified standard pattern and also can will transforms the standard Word document obtained and be reduced to the method for original copy form conversely.To simplify translation, to improve translation efficiency.”
To above-mentioned passage, application Aspose assembly, after traveling through above-mentioned paragraph object, obtain several character set object Run, order is respectively: Run-1: " in order to solve the problem; special propose a kind of will ", Run-2: " Word; Excel; PPT, PDF ", Run-3: " wait multiple main flow document format be converted into unified standard pattern ", Run-4: " Word ", Run-5: " document and also can will transform the standard obtained conversely ", Run-6: " Word ", Run-7: " document is reduced to the method for original copy form.To simplify translation, to improve translation efficiency.”
Obviously, above-mentioned Run-1 to Run-6, just comprises a sentence fragment, and Run-7 comprises two sentences seeming complete.Generate the data acquisition in units of complete words, Run-1 to Run-6 needs to merge, and Run-7 needs to split further.
S5, merging: travel through each Run object, follow-up only to include the Run object merging only including sentence fragment in the Run object of a complete sentence to it.Specifically comprise the following steps:
The character content that S5-1 will only be the Run object of a sentence fragment is taken out, and leaves temporary storage cell in, then in paragraph object, deletes this Run object;
S5-2 checks next Run object, if the character content of this Run object is only a sentence fragment, then takes out the character content of this Run object, add temporary storage cell to, then in paragraph object, delete this Run object, continue to check next but one Run object; Otherwise, take out temporary storage cell and deposit character content, before adding the character content of this next Run object to, then empty temporary storage cell.
If the character content of this next Run object of S5-3 is using sentence end mark as ending, then the character content deposited by described temporary storage cell is taken out, and before adding the character content of this next Run object to, then empties described temporary storage cell.
After completing, obtaining take sentence as least unit, only to include the set of the Run object of a complete sentence one by one.
S6, set up an interpreter and process document, interpreter processes document and is provided with " original text ", " translation " and id tri-fields, the corresponding sentence original text of " original text " field, the corresponding sentence translation of " translation " field;
S7, by with simple sentence be least unit set data to be translated in each sentence copy to " original text " field that described interpreter processes document in order one by one, then the content of this sentence in the set data to be translated occupy-place symbol Guid that has uniqueness is substituted, and adjacent occupy-place symbol Guid has different character format, such as, give adjacent occupy-place symbol Guid with different colors, make the character format of adjacent Guid different; There is between the content of id field and different Guid the relation mapped one by one;
S8, travel through described interpreter and process document, the sentence of repetition is marked.As shown in Figure 2, " Android integration guide " repeats, then mark with colour band, reminds interpreter not need to repeat translation.
S9, travel through described interpreter and process document, the sentence in original text and the term in terminology bank are carried out Auto-matching, if coupling, then this term sentence is annotated and commented on, make translation more smooth and easy.
S10, travel through described interpreter and process document, the sentence in original text is mated with the language material comparison in corpus one by one, if 100% coupling, then the language material translation in corpus is filled up in " translation " field corresponding to this coupling sentence.As in Fig. 2, there is conventional corresponding translation " SDKdownload " in " SDK download " in language material, and " FAQs " exists conventional corresponding translation " FAQ " in language material.Language material translation in corpus is filled up in " translation " field corresponding to this coupling sentence.If when matching degree is less than 100%, this guild is marked as certain color.This sentence is indicated to need interpreter to translate, but first automatic filling language material translation, for interpreter with reference to, amendment.
S11, interpreter is processed document be issued to interpreter, interpreter processes the original text translating " original text " field in document one by one described interpreter, be filled up to corresponding " translation " field, until processed;
S12, set up a dictionary object, the key of described dictionary object is original text, and value is translation, and original text-translation is a key-value pair; When traveling through described interpreter and processing document, original text-translation corresponding in recording, writes described dictionary object respectively.
S13, being uploaded onto the server as new language material to information by this language material remains, in next translation, and reference.
S14, travel through described set data to be translated and interpreter processes document, the different ids corresponding according to different Guid, find the translation that this id is corresponding, cover the position of writing this corresponding Guid in described set data to be translated.
If the translation hurdle of an id place record is empty, then in dictionary object, with the original text of this id place record for key removes to search the translation value whether having coupling, if found, fill translation hurdle with this translation.
If in dictionary object, do not find the translation value of coupling, then this sentence is that leakage is translated, and directly uses original text to fill, and the personnel that conveniently examine and revise find.
S15, call Aspose dynamic link library, described set data to be translated is reduced into original copy formatted file.
For EXCL, PPT, PDF document that the present invention mentions, those skilled in the art are by reading herein, can by ASpose assembly, realize the character information that these documents comprise, and according to disclosed method, carry out the fractionation of the data acquisition in units of sentence, combined treatment, generate interpreter and process document, be convenient to interpreter's translation; And after interpreter's translation, the reduction treatment of translation manuscript.Such as, for EXCL, PDF, PPT document, after can first utilizing existing instrument to convert corresponding Word document to, utilizing the method process of above-described embodiment.For EXCL document, also can directly by the process of ASpose assembly.
Obviously, the present invention has following beneficial features:
(1) the method is used the manuscript of multiple different-format can be converted into the standardization document of uniform format.Help the standardized management of translation, thus the translation simplifying interpreter improves translation efficiency.
(2) by the confluence analysis with batch all documents, the sentence repeated is found out.Repeat sentence repeatedly without the need to translation repeatedly, only need to translate when first time occurs, the translation of other these sentences of position will be filled automatically.
(3) the new language material information produced in translation process is saved to cloud service.By the continuous accumulation to language material, promote the translation ability of translator itself gradually.By the means that language material is multiplexing, promote translation efficiency.
(4) standard needing the contribution of translation to be all converted to consolidation form can be treated the manuscript of a translation, facilitate the management of team's translation, reduce the work difficulty of translation project manager.
(5) execution efficiency of extraction and reduction logic is higher, does not substantially bring negative effect to translation.
(6) algorithm can be revised as the case may be during reduction, reach the final effect that form retains.
It should be noted last that, above embodiment is only in order to illustrate technical scheme of the present invention and unrestricted, although with reference to preferred embodiment to invention has been detailed description, those of ordinary skill in the art is to be understood that, can modify to technical scheme of the present invention or equivalent replacement, and not departing from the spirit and scope of technical solution of the present invention, it all should be encompassed in the middle of right of the present invention.

Claims (8)

1. the file being beneficial to translation extracts and a method of reducing, it is characterized in that, comprises the following steps:
1) utilize Aspose dynamic link library to the operation support of document process, disassembling into document object to be translated with simple sentence is the set data to be translated of least unit;
2) set up an interpreter and process document, described interpreter processes document and is provided with " original text ", " translation " and id tri-fields, the corresponding sentence original text of described " original text " field, the corresponding sentence translation of " translation " field;
3) each sentence in the described set data to be translated taking simple sentence as least unit is copied to " original text " field that described interpreter processes document in order one by one, then the content of this sentence in the set data to be translated occupy-place symbol Guid that has uniqueness is substituted, and adjacent occupy-place symbol Guid has different character format; The content of described id field has between described different Guid the relation mapped one by one;
4) described interpreter is processed document and be issued to interpreter, described interpreter processes the original text translating " original text " field in document one by one described interpreter, is filled up to corresponding " translation " field, until processed;
5) travel through described set data to be translated and interpreter processes document, the different ids corresponding according to different Guid, find the translation that this id is corresponding, cover the position of writing this corresponding Guid in described set data to be translated;
6) call Aspose dynamic link library, described set reduction data to be translated is generated original copy format file.
2. the file being beneficial to translation according to claim 1 extracts and method of reducing, it is characterized in that, described disassemble into document object to be translated with sentence is the set data to be translated of least unit, comprises the following steps:
1-1 calls Aspose assembly;
1-2 travels through described document object, obtains whole paragraph object, and described paragraph object comprises the whole Word message of document object, and does not comprise symbol, image or other the non-legible information without the need to translating;
1-3 travels through the child node object of each paragraph object, thus obtains several character set object Run;
1-4 travels through each Run object, whole Run object is split into the Run object only including a complete sentence one by one, or for only including the Run object of a sentence fragment;
1-5 travels through each Run object, follow-up only to be included in the Run object of a complete sentence by the Run object merging only including sentence fragment to it.
3. the file being beneficial to translation according to claim 2 extracts and method of reducing, it is characterized in that, describedly will only include the Run object merging of a sentence fragment to follow-up Run object, comprises the following steps:
The character content that 1-4-1 will only be the Run object of a sentence fragment is taken out, and leaves temporary storage cell in, then in paragraph object, deletes this Run object;
1-4-2 checks next Run object, if the character content of this Run object is only a sentence fragment, then takes out the character content of this Run object, add temporary storage cell to, then in paragraph object, delete this Run object, continue to check next but one Run object; Otherwise, take out temporary storage cell and deposit character content, before adding the character content of this next Run object to, then empty described temporary storage cell;
If the character content of this next Run object of 1-4-3 is using sentence end mark as ending, the character content then deposited by described temporary storage cell is taken out, before adding the character content of this next Run object to, then empty described temporary storage cell.
4. the file being beneficial to translation according to claim 1 extracts and method of reducing, and it is characterized in that, also comprise, set up a dictionary object, the key of described dictionary object is original text, and value is translation, and original text-translation is a key-value pair; When traveling through described interpreter and processing document, original text-translation corresponding in recording, writes described dictionary object respectively.
5. the file being beneficial to translation according to claim 4 extracts and method of reducing, it is characterized in that, in step 5) in, if the translation hurdle of an id place record is empty, then in described dictionary object, with the original text of this id place record for key removes to search the translation value whether having coupling, if found, fill translation hurdle with this translation;
If in described dictionary object, do not find the translation value of coupling, then this sentence is that leakage is translated, and directly uses original text to fill.
6. the file being beneficial to translation according to claim 1 extracts and method of reducing, it is characterized in that, described interpreter being processed before document is issued to interpreter, traveling through described interpreter and processing document, the sentence of repetition is marked, reminds interpreter not need to repeat translation.
7. the file being beneficial to translation according to claim 1 extracts and method of reducing, it is characterized in that, described interpreter is being processed before document is issued to interpreter, travel through described interpreter and process document, sentence in original text and the term in terminology bank are carried out Auto-matching, if coupling, then annotate and comment on this term sentence, make translation more smooth and easy.
8. the file being beneficial to translation according to claim 1 extracts and method of reducing, it is characterized in that, described interpreter is being processed before document is issued to interpreter, travel through described interpreter and process document, sentence in original text is mated with the language material comparison in corpus one by one, if coupling, be then filled up to the language material translation in corpus in " translation " field corresponding to this coupling sentence.
CN201510357672.2A 2015-06-25 2015-06-25 A kind of file beneficial to translation is extracted and restoring method Active CN104933041B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201510357672.2A CN104933041B (en) 2015-06-25 2015-06-25 A kind of file beneficial to translation is extracted and restoring method
PCT/CN2015/098668 WO2016206336A1 (en) 2015-06-25 2015-12-24 File extraction and restoration method favorable for translation work

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510357672.2A CN104933041B (en) 2015-06-25 2015-06-25 A kind of file beneficial to translation is extracted and restoring method

Publications (2)

Publication Number Publication Date
CN104933041A true CN104933041A (en) 2015-09-23
CN104933041B CN104933041B (en) 2017-09-01

Family

ID=54120210

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510357672.2A Active CN104933041B (en) 2015-06-25 2015-06-25 A kind of file beneficial to translation is extracted and restoring method

Country Status (2)

Country Link
CN (1) CN104933041B (en)
WO (1) WO2016206336A1 (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105677643A (en) * 2016-03-14 2016-06-15 张广睿 Translation method combining manpower and machine
CN105760368A (en) * 2016-03-11 2016-07-13 张广睿 Deep processing method for characters of document
CN105808528A (en) * 2016-03-04 2016-07-27 张广睿 Document character processing method
CN105975451A (en) * 2016-05-27 2016-09-28 成都优译信息技术有限公司 Processing system and method for DWG-format-file translation data
CN105975461A (en) * 2016-05-27 2016-09-28 成都优译信息技术有限公司 Method for newly adding translation in DWG format file
CN106021242A (en) * 2016-05-27 2016-10-12 成都优译信息技术有限公司 Write-back system of drawing interpretation data in DWG format and write-back method thereof
CN106021197A (en) * 2016-05-27 2016-10-12 成都优译信息技术有限公司 Translation system of DWG-format files and translation method
CN106055529A (en) * 2016-05-27 2016-10-26 成都优译信息技术有限公司 Analytic system of text data to be translated in DWG-format files and analytic method thereof
WO2016206336A1 (en) * 2015-06-25 2016-12-29 武汉传神信息技术有限公司 File extraction and restoration method favorable for translation work
CN106919558A (en) * 2015-12-24 2017-07-04 姚珍强 For the interpretation method and translating equipment based on natural dialogue mode of mobile device
CN107590140A (en) * 2017-10-17 2018-01-16 语联网(武汉)信息技术有限公司 Entry process method is translated in a kind of document leakage
CN107885735A (en) * 2017-11-21 2018-04-06 语联网(武汉)信息技术有限公司 A kind of unrelated document translation method and system of form
CN108563645A (en) * 2018-04-24 2018-09-21 成都智信电子技术有限公司 The metadata interpretation method and device of HIS systems
CN109446531A (en) * 2018-09-06 2019-03-08 语联网(武汉)信息技术有限公司 Detect the method, apparatus and electronic equipment of translation progress
CN109783826A (en) * 2019-01-15 2019-05-21 四川译讯信息科技有限公司 A kind of document automatic translating method
CN111143074A (en) * 2019-12-30 2020-05-12 文思海辉智科科技有限公司 Method and device for distributing translation files
CN111144070A (en) * 2019-12-31 2020-05-12 北京迈迪培尔信息技术有限公司 Document parsing translation method and device
CN111291575A (en) * 2020-02-28 2020-06-16 北京字节跳动网络技术有限公司 Text processing method and device, electronic equipment and storage medium
CN112052648A (en) * 2020-09-02 2020-12-08 文思海辉智科科技有限公司 String translation method and device, electronic equipment and storage medium
CN112766003A (en) * 2021-01-20 2021-05-07 语联网(武汉)信息技术有限公司 Document auxiliary translation method and device
CN113705158A (en) * 2021-09-26 2021-11-26 上海一者信息科技有限公司 Method for intelligently restoring original text style in document translation

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110555196B (en) * 2018-05-30 2023-07-18 北京百度网讯科技有限公司 Method, apparatus, device and storage medium for automatically generating article
CN109617974B (en) * 2018-12-21 2021-12-28 珠海金山办公软件有限公司 Request processing method and device and server
CN110688863B (en) * 2019-09-25 2023-04-07 六维联合信息科技(北京)有限公司 Document translation system and document translation method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5848386A (en) * 1996-05-28 1998-12-08 Ricoh Company, Ltd. Method and system for translating documents using different translation resources for different portions of the documents
CN104331399A (en) * 2014-07-25 2015-02-04 一朵云(北京)科技有限公司 Dictionary tree translation method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102982027A (en) * 2011-09-02 2013-03-20 北大方正集团有限公司 Method and device for abstracting contents in document
CN104933041B (en) * 2015-06-25 2017-09-01 武汉传神信息技术有限公司 A kind of file beneficial to translation is extracted and restoring method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5848386A (en) * 1996-05-28 1998-12-08 Ricoh Company, Ltd. Method and system for translating documents using different translation resources for different portions of the documents
CN104331399A (en) * 2014-07-25 2015-02-04 一朵云(北京)科技有限公司 Dictionary tree translation method

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2016206336A1 (en) * 2015-06-25 2016-12-29 武汉传神信息技术有限公司 File extraction and restoration method favorable for translation work
CN106919558A (en) * 2015-12-24 2017-07-04 姚珍强 For the interpretation method and translating equipment based on natural dialogue mode of mobile device
CN105808528A (en) * 2016-03-04 2016-07-27 张广睿 Document character processing method
CN105808528B (en) * 2016-03-04 2019-01-25 张广睿 A kind of processing method of document text
CN105760368A (en) * 2016-03-11 2016-07-13 张广睿 Deep processing method for characters of document
CN105760368B (en) * 2016-03-11 2019-02-12 张广睿 A kind of deep treatment method of document text
CN105677643A (en) * 2016-03-14 2016-06-15 张广睿 Translation method combining manpower and machine
CN106055529B (en) * 2016-05-27 2019-04-23 成都优译信息技术有限公司 The resolution system and its analytic method of text data to be translated in DWG formatted file
CN106021242B (en) * 2016-05-27 2019-04-23 成都优译信息技术有限公司 DWG format drawing interpretation write back data system and its write-back method
CN106021197A (en) * 2016-05-27 2016-10-12 成都优译信息技术有限公司 Translation system of DWG-format files and translation method
CN106055529A (en) * 2016-05-27 2016-10-26 成都优译信息技术有限公司 Analytic system of text data to be translated in DWG-format files and analytic method thereof
CN106021197B (en) * 2016-05-27 2019-04-23 成都优译信息技术有限公司 The translation system and interpretation method of DWG formatted file
CN105975451B (en) * 2016-05-27 2019-04-23 成都优译信息技术有限公司 The processing system and its processing method of DWG formatted file translation data
CN106021242A (en) * 2016-05-27 2016-10-12 成都优译信息技术有限公司 Write-back system of drawing interpretation data in DWG format and write-back method thereof
CN105975461A (en) * 2016-05-27 2016-09-28 成都优译信息技术有限公司 Method for newly adding translation in DWG format file
CN105975461B (en) * 2016-05-27 2019-04-23 成都优译信息技术有限公司 Increase the method for translation newly in DWG formatted file
CN105975451A (en) * 2016-05-27 2016-09-28 成都优译信息技术有限公司 Processing system and method for DWG-format-file translation data
CN107590140B (en) * 2017-10-17 2020-09-25 语联网(武汉)信息技术有限公司 Document missing item processing method
CN107590140A (en) * 2017-10-17 2018-01-16 语联网(武汉)信息技术有限公司 Entry process method is translated in a kind of document leakage
CN107885735A (en) * 2017-11-21 2018-04-06 语联网(武汉)信息技术有限公司 A kind of unrelated document translation method and system of form
CN107885735B (en) * 2017-11-21 2021-05-04 语联网(武汉)信息技术有限公司 Format-independent document translation method and system
CN108563645A (en) * 2018-04-24 2018-09-21 成都智信电子技术有限公司 The metadata interpretation method and device of HIS systems
CN108563645B (en) * 2018-04-24 2022-03-22 成都智信电子技术有限公司 Metadata translation method and device of HIS (hardware-in-the-system)
CN109446531A (en) * 2018-09-06 2019-03-08 语联网(武汉)信息技术有限公司 Detect the method, apparatus and electronic equipment of translation progress
CN109783826A (en) * 2019-01-15 2019-05-21 四川译讯信息科技有限公司 A kind of document automatic translating method
CN109783826B (en) * 2019-01-15 2023-11-21 四川译讯信息科技有限公司 Automatic document translation method
CN111143074A (en) * 2019-12-30 2020-05-12 文思海辉智科科技有限公司 Method and device for distributing translation files
CN111143074B (en) * 2019-12-30 2024-04-09 文思海辉智科科技有限公司 Method and device for distributing translation files
CN111144070A (en) * 2019-12-31 2020-05-12 北京迈迪培尔信息技术有限公司 Document parsing translation method and device
CN111291575A (en) * 2020-02-28 2020-06-16 北京字节跳动网络技术有限公司 Text processing method and device, electronic equipment and storage medium
CN112052648A (en) * 2020-09-02 2020-12-08 文思海辉智科科技有限公司 String translation method and device, electronic equipment and storage medium
CN112766003A (en) * 2021-01-20 2021-05-07 语联网(武汉)信息技术有限公司 Document auxiliary translation method and device
CN113705158A (en) * 2021-09-26 2021-11-26 上海一者信息科技有限公司 Method for intelligently restoring original text style in document translation

Also Published As

Publication number Publication date
CN104933041B (en) 2017-09-01
WO2016206336A1 (en) 2016-12-29

Similar Documents

Publication Publication Date Title
CN104933041A (en) File extraction and reduction method favorable for translation work
CN105138350B (en) A kind of configuration software internationalizes multilingual implementation method
KR20150036041A (en) Phrase-based dictionary extraction and translation quality evaluation
McCarthy et al. Marrying universal dependencies and universal morphology
CN105808528A (en) Document character processing method
CN111178088A (en) Configurable neural machine translation method oriented to XML document
Pradet et al. Adapting VerbNet to French using existing resources
CN104331400B (en) A kind of Mongolian code conversion method and device
CN113867694B (en) Method and system for intelligently generating front-end code
CN106156013A (en) The two-part machine translation method that a kind of regular collocation type phrase is preferential
CN109947711B (en) Automatic multi-language file management method in IOS project development process
CN102629244B (en) Multi-language work card generating system and method
CN106156007A (en) A kind of English-Chinese statistical machine translation method of word original shape
Modh et al. A STUDY OF MACHINE TRANSLATION APPROACHES FOR GUJARATI
Mohammed et al. English to Arabic machine translation based on reordring algorithm
CN105893056A (en) Template and template logic combining method and system
Alkazemi et al. AccurIT: a prototype of a machine translation engine for English to Arabic translation
Tufis et al. Parallel corpora, alignment technologies and further prospects in multilingual resources and technology infrastructure
CN111368091A (en) Document translation method and device
Raiss et al. ANMorph: Amazigh nouns morphological analyzer
Blahuš et al. Extending Czech WordNet using a bilingual dictionary
Zaki et al. A survey of syntactic parsers of Arabic language
Zhang Development of Computer Intelligent Proofreading System from the Perspective of Medical English Translation Application
CN114548127A (en) Mixed neural machine translation method and system for new and old Uyghur characters
Zimmermann et al. Patterns for Tailoring E-Learning Materials to Make them Suited for Changed Requirement

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: 430074 East Lake Development Zone, Wuhan, Hubei Province, Optics Valley Software Park in the west, south of Nanhu South Road, Optics Valley Software Park, 2 5 layers 205

Patentee after: Language network (Wuhan) Information Technology Co., Ltd.

Address before: 430074, Optics Valley Software Park, East Lake Development Zone, Wuhan, south of Hubei, South Lake Road, Optics Valley Software Park, 2, six, 5, No. 205

Patentee before: Wuhan Transn Information Technology Co., Ltd.

CP03 Change of name, title or address