python - NLTK Relation Extraction - custom corpus in relextract.extract_rels -
i learnt there built-in function in nltk extract relations ner-tagged sentences according following: import re in = re.compile(r'.*\bin\b(?!\b.+ing\b)') fileid in ieer.fileids(): doc in ieer.parsed_docs(fileid): rel in relextract.extract_rels('org', 'loc', doc, corpus='ieer', pattern = in): print(relextract.rtuple(rel)) it seems me promising general purpose, understood relextract.extract_rels accepts 'ieer' or 'conll2002' parameter corpus . in case, usage restricted these 2 corpora, isn't it? how 1 utilize own corpus (presuming, of course, ner-tagged). this should more of comment don't have enough reputation. you can pass custom corpus doc argument after it's been pos-tagged , converted list of chunked trees. custom corpus, should use corpus='ace' . for example, in this answer , use extract_rels tag custom corpus.