Jeffrey C. Witt (Loyola University Maryland)
https://jeffreycwitt.com | jcwitt@loyola.edu
@jeffreycwitt
June 06, 2023, DHSI, University of Victoria, Victoria, BC, Canada
Paragraph | 4gram1 | 4gram2 | 4gram3 | 4gram4 | 4gram5 | 4gram6 | 4gram7 | 4gram8 | 4gram9 | 4gram10 | 4gram11 | 4gram12 |
Doc A | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 |
Paragraph | 4gram1 | 4gram2 | 4gram3 | 4gram4 | 4gram5 | 4gram6 | 4gram7 | 4gram8 | 4gram9 | 4gram10 | 4gram11 | 4gram12 |
sum |
Doc A | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 |
8 |
Doc B | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | 1 |
10 |
A * B (Dot Product Vector) | 1x1=1 | 1x1=1 | 0x1=0 | 1x0=0 | 0x1=0 | 1x1=1 | 0x0=0 | 1x1=1 | 0x0=0 | 0x1=0 | 1x1=1 | 1x1=1 |
6 |
SuccessiveReuse = Windows with Convolution Score >= 3
||5|6|7|8|9|10|11|12| |---|---|---|---|---|---|---|---|---|---| |2|0|0|0|0|0|0|0|0| |3|0|0|**1**|**0**|**0**|**0**|0|0| |4|0|0|**0**|**0**|**0**|**0**|0|0| |5|0|0|**0**|**0**|**1**|**0**|0|0| |6|0|0|**0**|**1**|**0**|**1**|0|0| |7|0|0|0|0|0|0|0|0| |8|0|0|0|0|0|0|0|0| |9|0|0|0|0|0|0|0|0| | X |
||||| |---|---|---| |1|0|0|0| |0|1|0|0| |0|0|1|0| |0|0|0|1| | = |
||||| |---|---|---| |1|0|0|0| |0|0|0|0| |0|0|1|0| |0|0|0|1| | = |
3 |