资源描述:
《Text Analysis of Patent Abstracts》由会员上传分享,免费在线阅读,更多相关内容在学术论文-天天文库。
1、TextAnalysisofPatentAbstractsYvonneTsai,NationalTaiwanUniversityABSTRACTTextanalysisinvolvesthedeconstructionofinformationwithinatext.Thisincludestextstructure,textpattern,linguisticfeatures,lexicalanalysis,andsyntacticanalysis.Thisresearchtookasitsstartingpointthebottom-upapproachofanalysingthelexi
2、calfeatures,syntacticfeatures,andtextualfeaturesofpatentabstractsforcomprehensivecoverageoftextanalysis.Severaltoolshavebeenappliedintheanalysisofpatentabstracts.Thisthree-foldanalysisoftextoutlinedaboveembracesinformationonsentencestatistics,segmentationstatistics,wordfrequencies,lexicaldensities,a
3、ndreadabilitylevels.ItwasfoundthatEnglishtranslatedtextspresentedamoreconsistentuseofshortsentencesthanintheoriginalChinesetexts,andacommonusageofshorterwordswasalsoevidentinthetranslatedtexts.Whileshortsentences,shortwordlength,andhighrepetitionsofwordscharacterisedtextswitheasyreadability,findings
4、fromthereadabilitytestsindicatedthatinordertounderstandpatentabstractswithoutdifficulty,readersshouldhavereceivedatleast14yearsofeducation.KEYWORDSPatentabstract,textanalysis,readability,syntacticanalysis,lexicalanalysis,lexicaldensity,segmentation.IntroductionTherearemanywaystoanalysetexts:contenta
5、nalysis,textualanalysis,andtextanalytics.Theseinter-relatedtermsandconceptsinvolvesystematicapproachesindeconstructinginformationwithinatext.Theinformationforanalysisusuallyincludestextstructure,textpattern,linguisticfeatures,lexicalanalysis,andsyntacticanalysis.Thestructureofatextrelatestotextpatte
6、rnandlinguisticfeatures,whilelinguisticfeaturesprovidelexicalandsyntacticproperties.Thisresearchbeganfromabottom-upapproachbyanalysingthelexicalfeatures,syntacticfeatures,andtextualfeaturesofpatentabstractsforcomprehensivecoverageoftextanalysis.Thisthree-foldanalysisoftextfurtherembracedinformationo
7、nsentencestatistics,segmentationstatistics,wordfrequencies,lexicaldensities,andreadabilitylevels.AsOlohanpointsout,“data-basedordata-drivenanalysisofthetext[…]isclearlyalignedwiththedescriptiveperspec