home << dhlab reference << dhlab.text.chunking

Chunks#

from dhlab.text.chunking import Chunks
class Chunks(urn=None, chunks=1000)[source]#

Bases: object

Create chunks from a text.

Parameters:
  • urn – str or list

  • chunks – {‘para’, ‘avsn’} or int

to_pandas()[source]#

Vectorize into a pandas dataframe with words a index