home << dhlab reference << dhlab.text.corpus_collection

CorpusCollection#

from dhlab.text.corpus_collection import CorpusCollection
class CorpusCollection(corpora=None)[source]#

Bases: object

A class for handling a collection of corpora.

Initialize the class with a dictionary of corpora.

add(name, corpus)[source]#

Add a corpus to the collection.

concat_corpora()[source]#

Concatenate all corpora in the collection into a single corpus.

get(name)[source]#

Get a corpus by name.

remove(name)[source]#

Remove a corpus from the collection.

show_corpora()[source]#

Show the corpora in the collection.