tag:blogger.com,1999:blog-6905541471605961414.post6651570025544986334..comments2024-03-28T17:25:17.827+07:00Comments on Thai 101: Slides from my conference talkRikkerhttp://www.blogger.com/profile/17196282287835224940noreply@blogger.comBlogger3125tag:blogger.com,1999:blog-6905541471605961414.post-62437833565360283232008-07-10T14:10:00.000+07:002008-07-10T14:10:00.000+07:00Thanks for posting this and all your effort on the...Thanks for posting this and all your effort on the blog.xJChttps://www.blogger.com/profile/00380320419195061742noreply@blogger.comtag:blogger.com,1999:blog-6905541471605961414.post-92074879425569042412008-07-08T05:38:00.000+07:002008-07-08T05:38:00.000+07:00This example is from the "fixed" corpus, not the w...This example is from the <A HREF="http://sealang.net/thai/corpus.htm" REL="nofollow">"fixed" corpus</A>, not the <A HREF="http://sealang.net/webcorpus/thai" REL="nofollow">web-based corpus</A>, by the way. That's why you're getting so few results. There are a number of different sub-corpora you can select from at the bottom of the left-hand control panel, the default being a corpus of news stories.<BR/><BR/>So anyway, this shows the collocates (aka the "word neighbors"), or which other words commonly appear with the phrase you searched. A leading collocate means a word which comes before your phrase, and trailing collocate is one that comes after.<BR/><BR/>Here the underscore _ means a space--so in this case, the "collocate" is really just a space, which is to say that in 2 instances, or 50% of the time (like I said, small corpus, which is why 2 hits is 50%), a space immediately follows the phrase you searched.<BR/><BR/>Next it gives you the instances it found. It pulls out a predefined character-size sample, which you can control with the "context size" box on the left. It doesn't check to see where words/sentences begin or end, but just pulls out the word and collocates in their raw context.<BR/><BR/>In this case it looks like it amounts to about 30 characters of text on either side of your search phrase.<BR/><BR/>As with pretty much anything on SEAlang, there's way more functionality than there is polish. <BR/><BR/>In the near future there will start to be screen casts on the site to demonstrate how to use some of the basic tools and advanced features.Rikkerhttps://www.blogger.com/profile/17196282287835224940noreply@blogger.comtag:blogger.com,1999:blog-6905541471605961414.post-50365079634972352142008-07-07T23:16:00.000+07:002008-07-07T23:16:00.000+07:00Rikker,Here is a sample from the "Corpus".(2 / 50%...Rikker,<BR/><BR/>Here is a sample from the "Corpus".<BR/><BR/>(2 / 50%) ตามสัญชาตญาณ_ อาชีพ จึงหยุดรถคว้ากล้องลงไปดู ตามสัญชาตญาณ_ > ไม่เพียงแต่คนขับสามล้อเท่านั้ ว์นี่ไม่ได้เป็นวัฒนธรรม เขากิน ตามสัญชาตญาณ_ > ไม่มีว่านกจับหนอนแล้วไปโยนใส่ <BR/><BR/>How do I read this? Thanks for all your very hard work in this project.Davidhttps://www.blogger.com/profile/17698125439932656082noreply@blogger.com