ย้ายบล็อกไปที่ bact.cc แล้วนะครับ

พ.ร.บ.คอมพิวเตอร์
หยุด ร่างพ.ร.บ.คอมพิวเตอร์
พื้นที่เก็บข้อมูลออนไลน์ ฟรี 2GB จาก Dropbox (sync กับ Windows, Linux, Mac, iPhone, Android ฯลฯ ได้)

2004-06-19

Penn Treebank in XML format

Vee, you may interest in this one.

No instant noodle here. Still have to work something out, but it shouldn't be a difficult task.

TIGERRegistry PTB -> TIGER XML filter, included in TIGERSearch, a treebank explorer package.
TIGER API, Java lib for accessing corpus in TIGER XML format.

3 comments:

Anonymous said...

Did it work out?

BTW, Tiger API has moved. This is the new URL: TIGER API.

We have also included a section describing how to access corpora encoded in Penn Treebank format and other formats.

oeze, one of the authors

bact' said...

no news from his (vee) side,
so I think he didn't tried that yet
(.. actually, I'm not sure even he has an access to Penn Treebank).

bact' said...

but, anyway, thanks for the software :)
(and it's a GPL as well, coolx2 ! :D)