Racket Slack Archive

ccshan

2018-6-28 20:02:40

Do I understand correctly that, after the successful email conversation with Dan, @rjnw is now un-stuck on both the NB and the LDA benchmarks?

samth

2018-6-28 20:05:12

I think LDA was working, not sure about NB

rjnw

2018-6-28 20:51:56

There are two email threads, we only worked on lda. He replied to the NB thread too, but I don’t know what’s going on there.

ccshan

2018-6-28 20:57:34

On LDA, I believe you are about to produce plots. On NB, I think you should apply the same dtype debugging as LDA to NB (whether 1D or 2D), and see if AugurV2 stops classifying every test-set document as newsgroup # 19.

ccshan

2018-6-28 22:23:16

(reminder: MALLET for LDA is also worth comparing to)

rjnw

2018-6-28 22:24:23

hmm I will see if I can do mallet, right now I am working on augur and hakaru.

rjnw

2018-6-28 22:24:39

Is there a way to normalize the likelihood?

rjnw

2018-6-28 22:24:50

we get very small numbers in logspace

rjnw

2018-6-28 22:25:28

like –30613812.14543229 in logspace

rjnw

2018-6-28 22:27:34

right now I am just trying to subtract the smallest value across both augur and hakaru

ccshan

2018-6-28 22:30:43

Why bother normalize? Just plot. There’s no need to show 0 on the vertical axis.

rjnw

2018-6-28 22:31:13

hmm okay

ccshan

2018-6-28 22:33:07

The fact that the numbers are so small just means it is extremely unlikely for a random text generator to generate exactly the 20-newsgroups corpus. https://commons.wikimedia.org/wiki/File:Chimpanzee_seated_at_typewriter.jpg

rjnw

2018-6-29 02:07:46

rjnw

2018-6-29 02:08:13

I ran it twice but the plot is displayed only with one trial.

rjnw

2018-6-29 02:09:39

I am going to compare llvm with haskell backend to see if the compilation is correct.

rjnw

2018-6-29 02:20:34

Oh, x-axis is time in seconds and y-axis likelihood

ccshan

2018-6-29 02:38:21

I guess by “compare” you mean comparing the probabilities computed for a single update, because a whole sweep would take the Haskell backend too long.

How many sweeps does that llvm curve represent?