File size: 3,544 Bytes
d916065
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
.. Copyright (C) 2001-2016 NLTK Project
.. For license information, see LICENSE.TXT

==================================
Concordance Example
==================================

A concordance view shows us every occurrence of a given
word, together with some context. Here we look up the word monstrous
in Moby Dick by entering text1 followed by a period, then the term
concordance, and then placing "monstrous" in parentheses:

>>> from nltk.corpus import gutenberg
>>> from nltk.text import Text
>>> corpus = gutenberg.words('melville-moby_dick.txt')
>>> text = Text(corpus)

>>> text.concordance("monstrous")
Displaying 11 of 11 matches:
ong the former , one was of a most monstrous size . ... This came towards us ,
ON OF THE PSALMS . " Touching that monstrous bulk of the whale or ork we have r

ll over with a heathenish array of monstrous clubs and spears . Some were thick

d as you gazed , and wondered what monstrous cannibal and savage could ever hav

that has survived the flood ; most monstrous and most mountainous ! That Himmal

they might scout at Moby Dick as a monstrous fable , or still worse and more de

th of Radney .'" CHAPTER 55 Of the Monstrous Pictures of Whales . I shall ere l
ing Scenes . In connexion with the monstrous pictures of whales , I am strongly
ere to enter upon those still more monstrous stories of them which are to be fo
ght have been rummaged out of this monstrous cabinet there is no telling . But
of Whale - Bones ; for Whales of a monstrous size are oftentimes cast up dead u

>>> text.concordance("monstrous")
Displaying 11 of 11 matches:
ong the former , one was of a most monstrous size . ... This came towards us ,
ON OF THE PSALMS . " Touching that monstrous bulk of the whale or ork we have r

ll over with a heathenish array of monstrous clubs and spears . Some were thick

...



We can also search for a multi-word phrase by passing a list of strings:



>>> text.concordance(["monstrous", "size"])

Displaying 2 of 2 matches:

the former , one was of a most monstrous size . ... This came towards us , op

Whale - Bones ; for Whales of a monstrous size are oftentimes cast up dead upo



=================================

Concordance List

=================================



Often we need to store the results of concordance for further usage.

To do so, call the concordance function with the stdout argument set

to false:



>>> from nltk.corpus import gutenberg

>>> from nltk.text import Text

>>> corpus = gutenberg.words('melville-moby_dick.txt')

>>> text = Text(corpus)

>>> con_list = text.concordance_list("monstrous")

>>> con_list[2].line

'll over with a heathenish array of monstrous clubs and spears . Some were thick'

>>> len(con_list)

11



=================================

Patching Issue #2088

=================================



Patching https://github.com/nltk/nltk/issues/2088

The left slice of the left context should be clip to 0 if the `i-context` < 0.



>>> from nltk import Text, word_tokenize

>>> jane_eyre = 'Chapter 1\nTHERE was no possibility of taking a walk that day. We had been wandering, indeed, in the leafless shrubbery an hour in the morning; but since dinner (Mrs. Reed, when there was no company, dined early) the cold winter wind had brought with it clouds so sombre, and a rain so penetrating, that further outdoor exercise was now out of the question.'

>>> text = Text(word_tokenize(jane_eyre))

>>> text.concordance_list('taking')[0].left

['Chapter', '1', 'THERE', 'was', 'no', 'possibility', 'of']