{"id":2507,"date":"2025-03-14T17:44:09","date_gmt":"2025-03-14T16:44:09","guid":{"rendered":"https:\/\/presto.digiamo.cz\/presto\/?p=2507"},"modified":"2025-11-27T19:44:14","modified_gmt":"2025-11-27T18:44:14","slug":"jazykovy-korpus","status":"publish","type":"post","link":"https:\/\/www.presto.cz\/cz\/jazykovy-korpus","title":{"rendered":"Jazykov\u00fd korpus"},"content":{"rendered":"\n<p><strong>Pot\u0159ebujete se dozv\u011bd\u011bt, k\u202f\u010demu slou\u017e\u00ed jazykov\u00fd korpus, kdo jej vyu\u017e\u00edv\u00e1<\/strong><strong>&nbsp;a&nbsp;za jak\u00fdm \u00fa\u010delem<\/strong><strong>? R\u00e1di byste se dozv\u011bd\u011bli v\u00edce informac\u00ed o&nbsp;\u010desk\u00e9m n\u00e1rodn\u00edm korpusu a&nbsp;korpusov\u00e9 lingvistice? V\u202ftomto \u010dl\u00e1nku se m\u016f\u017eete bl\u00ed\u017ee sezn\u00e1mit s\u202ftouto problematikou.<\/strong>&nbsp;<\/p>\n\n\n\n<p><strong>V\u202ftomto \u010dl\u00e1nku se dozv\u00edte<\/strong><strong>:<\/strong>&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Co je jazykov\u00fd korpus.&nbsp;<\/li>\n\n\n\n<li>\u00da\u010del jazykov\u00e9ho korpusu.&nbsp;<\/li>\n\n\n\n<li>Kdo vyu\u017e\u00edv\u00e1 jazykov\u00fd korpus.&nbsp;<\/li>\n\n\n\n<li>\u010cesk\u00fd n\u00e1rodn\u00ed korpus (\u010cNK).&nbsp;<\/li>\n\n\n\n<li>Korpusov\u00e1 lingvistika.&nbsp;<\/li>\n\n\n\n<li>\u00dastav pro jazyk \u010cesk\u00e1 (UJ\u010c).&nbsp;<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Co je jazykov\u00fd korpus?<\/strong>&nbsp;<\/h2>\n\n\n\n<p>Jazykov\u00fdm korpusem se m\u00edn\u00ed obs\u00e1hl\u00fd&nbsp;<strong>soubor autentick\u00fdch text\u016f<\/strong>&nbsp;ur\u010dit\u00e9ho jazyka,&nbsp;jedn\u00e1 se o&nbsp;texty jak psan\u00e9, tak&nbsp;mluven\u00e9, kter\u00e9 byly&nbsp;<strong>p\u0159evedeny do elektronick\u00e9 podoby<\/strong>. Elektronick\u00e1 podoba v\u202fsou\u010dasn\u00e9 dob\u011b usnad\u0148uje nejen sb\u011br dat, ale i&nbsp;jejich vyhled\u00e1v\u00e1n\u00ed. Texty jsou v\u017edy zad\u00e1v\u00e1ny v\u202fjednotn\u00e9m souboru, aby umo\u017enily jednoduch\u00e9&nbsp;<strong>vyhled\u00e1v\u00e1n\u00ed&nbsp;<\/strong><strong>konkr\u00e9tn\u00edch<\/strong><strong>&nbsp;jazykov\u00fdch jev\u016f<\/strong>, zejm\u00e9na lexikologick\u00fdch (slova a&nbsp;slovn\u00ed spojen\u00ed).&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>\u00da\u010del jazykov\u00e9ho korpusu<\/strong>&nbsp;<\/h2>\n\n\n\n<p>Jazykov\u00fd korpus slou\u017e\u00ed prim\u00e1rn\u011b k&nbsp;lingvistick\u00e9mu\u202fv\u00fdzkumu jazykov\u00e9 praxe jako datov\u00e1 z\u00e1kladna k\u202ftvorb\u011b v\u00fdkladov\u00fdch slovn\u00edk\u016f, v\u00edcejazy\u010dn\u00fdch slovn\u00edk\u016f, automatick\u00fdch p\u0159eklada\u010d\u016f \u010di automatick\u00fdch korektor\u016f. Korpus umo\u017e\u0148uje zobrazovat hledan\u00e9&nbsp;<strong>jazykov\u00e9 jevy v\u202fp\u0159irozen\u00e9m kontextu<\/strong>, \u010d\u00edm\u017e usnad\u0148uje odborn\u00edk\u016fm v&nbsp;jejich v\u00fdzkumu zji\u0161\u0165ovat frekvenci v\u00fdskytu dan\u00e9ho jevu na z\u00e1klad\u011b p\u016fvodn\u00edch zdroj\u016f a&nbsp;informovat je o&nbsp;za\u0159azen\u00ed slov do r\u016fzn\u00fdch kategori\u00ed.&nbsp;<\/p>\n\n\n\n<p><strong>Kdo vyu\u017e\u00edv\u00e1 jazykov\u00fd korpus<\/strong>&nbsp;<\/p>\n\n\n\n<p>Jazykov\u00fd korpus pro sv\u016fj v\u00fdzkum pou\u017e\u00edvaj\u00ed&nbsp;<strong>p\u0159edev\u0161\u00edm lingvist\u00e9<\/strong>, nicm\u00e9n\u011b tento lexikologick\u00fd n\u00e1stroj vyu\u017e\u00edvaj\u00ed i&nbsp;dal\u0161\u00ed odborn\u00edci r\u016fzn\u00fdch obor\u016f, mezi kter\u00e9 pat\u0159\u00ed&nbsp;nap\u0159\u00edklad:&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Sociologov\u00e9 a&nbsp;sociolingvist\u00e9&nbsp;<\/li>\n\n\n\n<li>Redakto\u0159i&nbsp;<\/li>\n\n\n\n<li><a href=\"https:\/\/www.presto.cz\/cz\/dotaznik-prekladatele\" data-type=\"link\" data-id=\"https:\/\/www.presto.cz\/cz\/dotaznik-prekladatele\"><strong>P\u0159ekladatel\u00e9<\/strong><\/a>\u00a0<\/li>\n\n\n\n<li>Psychologov\u00e9&nbsp;<\/li>\n\n\n\n<li>U\u010ditel\u00e9&nbsp;&nbsp;<\/li>\n\n\n\n<li>Studenti ciz\u00edch jazyk\u016f&nbsp;<\/li>\n\n\n\n<li>Tv\u016frci u\u010debnic&nbsp;<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>\u010cesk\u00fd n\u00e1rodn\u00ed korpus (\u010cNK)<\/strong>&nbsp;<\/h2>\n\n\n\n<p>\u010cesk\u00fd jazykov\u00fd korpus je akademick\u00fdm projektem, je\u017e buduje&nbsp;<strong>\u00dastav \u010cesk\u00e9ho n\u00e1rodn\u00edho korpusu<\/strong>&nbsp;(\u00da\u010cNK),&nbsp;kter\u00fd&nbsp;byl zalo\u017een\u00fd v\u202froce 1994 p\u0159i&nbsp;<a href=\"https:\/\/www.ff.cuni.cz\/\"><strong>Filosofick\u00e9&nbsp;fakult\u011b&nbsp; Karlovy&nbsp;univerzity<\/strong><\/a>&nbsp;\u010desk\u00fdm lingvistou Franti\u0161kem \u010cerm\u00e1kem. Krom\u011b budov\u00e1n\u00ed korpusu m\u00e1 \u00da\u010cNK na starost jeho rozvoj, \u010dinnosti v\u202foblasti v\u00fduky a&nbsp;p\u011bstov\u00e1n\u00ed oboru korpusov\u00e1 lingvistika.&nbsp;<\/p>\n\n\n\n<p>C\u00edlem \u010cNK je systematick\u00e9 mapov\u00e1n\u00ed \u010desk\u00e9ho jazyka a&nbsp;dal\u0161\u00edch jazyk\u016f v\u202fporovn\u00e1n\u00ed s\u202f\u010cJ.&nbsp;Korpusy \u010cNK&nbsp;obsahuj\u00ed&nbsp;<strong>p\u0159es 4 miliardy slov sou\u010dasn\u00e9ho psan\u00e9ho jazyka<\/strong>, p\u0159es 7 milion slov jazyka mluven\u00e9ho, d\u00e1le zahrnuje star\u0161\u00ed texty a&nbsp;p\u0159eklady do a&nbsp;z\u202f30 ciz\u00edch jazyk\u016f. Do syst\u00e9mu \u010cNK se m\u016f\u017ee kter\u00fdkoliv z\u00e1jemce o&nbsp;\u010desk\u00fd jazyk bezplatn\u011b registrovat.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Korpusov\u00e1 lingvistika<\/strong>&nbsp;<\/h2>\n\n\n\n<p>Korpusov\u00e1 lingvistika p\u0159edstavuje odv\u011btv\u00ed lingvistiky, kter\u00e9 se zab\u00fdv\u00e1&nbsp;<strong>zkoum\u00e1n\u00edm jazyka pomoc\u00ed jazykov\u00fdch korpus\u016f<\/strong>,&nbsp;<strong>zpracov\u00e1n\u00edm korpus\u016f<\/strong>, jejich v\u00fdstavbou a&nbsp;p\u0159\u00edslu\u0161nou metodologi\u00ed. Rozvoj tohoto oboru je \u00fazce spjat s\u202fv\u00fdvojem modern\u00edch informa\u010dn\u00edch technologi\u00ed, jedn\u00e1 se tedy o&nbsp;pom\u011brn\u011b mlad\u00fd v\u011bdn\u00ed obor. Tyto modern\u00ed technologie toti\u017e umo\u017e\u0148uj\u00ed jazykov\u00e1 data zpracov\u00e1vat zp\u016fsobem, kter\u00fd by manu\u00e1ln\u011b nebyl mo\u017en\u00fd.&nbsp;<\/p>\n\n\n\n<p>Korpusov\u00e1 lingvistika vznikla v\u202f<strong>50.&nbsp;letech 20.&nbsp;stolet\u00ed<\/strong>&nbsp;na popud americk\u00fdch lingvist\u016f, kte\u0159\u00ed si uv\u011bdomili d\u016fle\u017eitost existence jazykov\u00e9ho korpusu pro vytvo\u0159en\u00ed popisu gramatiky p\u0159irozen\u00e9ho jazyka. V\u202froce 1967 za\u010dal \u010cech Henry Ku\u010dera pracovat s\u202famerick\u00fdm kolegou v\u202fUSA na po\u010d\u00edta\u010dov\u00e9m projektu sou\u010dasn\u00e9 americk\u00e9 angli\u010dtiny. V&nbsp;\u010cesk\u00e9 republice je vznik jazykov\u00e9ho korpusu spjat se zalo\u017een\u00edm ji\u017e v\u00fd\u0161e zmi\u0148ovan\u00e9ho&nbsp;<strong>\u00dastavu \u010cesk\u00e9ho n\u00e1rodn\u00edho korpusu.<\/strong>&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>\u00dastav pro jazyk \u010desk\u00fd (UJ\u010c)<\/strong>&nbsp;<\/h2>\n\n\n\n<p>\u00dastav pro jazyk \u010desk\u00fd&nbsp;pat\u0159\u00ed mezi \u00fastavy Akademie v\u011bd \u010cesk\u00e9 republiky a&nbsp;zab\u00fdv\u00e1&nbsp;se&nbsp;<strong>v\u011bdeck\u00fdm v\u00fdzkumem&nbsp;spisovn\u00e9ho i&nbsp;nespisovn\u00e9ho&nbsp;<a href=\"https:\/\/www.presto.cz\/cz\/cestina\">\u010desk\u00e9ho jazyka<\/a><\/strong>&nbsp;vzhledem k\u202fjeho aktu\u00e1ln\u00edmu stavu, z\u202fhlediska historick\u00e9ho v\u00fdvoje a&nbsp;ve vztahu k\u202fjin\u00fdm jazyk\u016fm.&nbsp;V\u00fdzkum UJ\u010c se zam\u011b\u0159uje p\u0159edev\u0161\u00edm na jazykov\u00e9 slo\u017eky, mezi kter\u00e9 pat\u0159\u00ed:&nbsp;&nbsp;<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Slovn\u00ed z\u00e1soba&nbsp;<\/li>\n\n\n\n<li>Gramatick\u00e1 stavba&nbsp;<\/li>\n\n\n\n<li>V\u00fdstavba text\u016f a&nbsp;slohov\u00e1 diferenciace&nbsp;<\/li>\n\n\n\n<li>Didaktika jazyka&nbsp;<\/li>\n\n\n\n<li>Obecn\u00e1 lingvistika&nbsp;<\/li>\n<\/ul>\n\n\n\n<p>UJ\u010c sv\u00fdmi v\u011bdeck\u00fdmi v\u00fdzkumy poskytuje ve\u0159ejnosti&nbsp;<strong>poradenstv\u00ed v\u202foblasti \u010desk\u00e9ho jazyka.&nbsp;<\/strong>Z\u00edskan\u00e9 v\u00fdsledky jsou ve\u0159ejnosti zprost\u0159edkov\u00e1ny prost\u0159ednictv\u00edm internetu, ale i&nbsp;v\u202fti\u0161t\u011bn\u00e9 podob\u011b a&nbsp;jsou vyu\u017e\u00edv\u00e1ny nejen ve&nbsp;<strong>\u0161koln\u00ed i&nbsp;mimo\u0161koln\u00ed v\u00fdchov\u011b<\/strong>, ale komer\u010dn\u00ed praxi. UJ\u010c tak\u00e9 zaji\u0161\u0165uje&nbsp;<strong>specializovan\u00e9 v\u00fduky jazyk\u016f<\/strong>.&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Pot\u0159ebujete se dozv\u011bd\u011bt, k\u202f\u010demu slou\u017e\u00ed jazykov\u00fd korpus, kdo jej vyu\u017e\u00edv\u00e1&nbsp;a&nbsp;za jak\u00fdm \u00fa\u010delem? R\u00e1di byste se dozv\u011bd\u011bli v\u00edce informac\u00ed o&nbsp;\u010desk\u00e9m n\u00e1rodn\u00edm korpusu a&nbsp;korpusov\u00e9 lingvistice? V\u202ftomto \u010dl\u00e1nku se m\u016f\u017eete bl\u00ed\u017ee sezn\u00e1mit s\u202ftouto problematikou.&nbsp; V\u202ftomto \u010dl\u00e1nku se dozv\u00edte:&nbsp; Co je jazykov\u00fd korpus?&nbsp; Jazykov\u00fdm korpusem se m\u00edn\u00ed obs\u00e1hl\u00fd&nbsp;soubor autentick\u00fdch text\u016f&nbsp;ur\u010dit\u00e9ho jazyka,&nbsp;jedn\u00e1 se o&nbsp;texty jak psan\u00e9, tak&nbsp;mluven\u00e9, kter\u00e9 byly&nbsp;p\u0159evedeny do [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":3038,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[22],"tags":[],"class_list":["post-2507","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-slovnik-pojmu"],"meta_box":[],"_links":{"self":[{"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/posts\/2507","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/comments?post=2507"}],"version-history":[{"count":3,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/posts\/2507\/revisions"}],"predecessor-version":[{"id":9154,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/posts\/2507\/revisions\/9154"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/media\/3038"}],"wp:attachment":[{"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/media?parent=2507"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/categories?post=2507"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.presto.cz\/cz\/wp-json\/wp\/v2\/tags?post=2507"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}