dhlab.text.chunking#

Module Contents#

Classes#

Chunks

Create chunks from a text.

API#

class dhlab.text.chunking.Chunks(urn=None, chunks=1000)#

Create chunks from a text.

Initialization

Parameters:
  • urn – str or list

  • chunks – {‘para’, ‘avsn’} or int

to_pandas()#

Vectorize into a pandas dataframe with words a index