Comments on: Clade – a freely available, open source taxonomy and autoclassification tool http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/ The Open Source Search Specialists Tue, 12 Feb 2019 14:44:32 +0000 hourly 1 https://wordpress.org/?v=4.9.8 By: BHANUMATHI H M http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-38966 Fri, 08 Jul 2016 09:55:51 +0000 http://www.flax.co.uk/blog/?p=791#comment-38966 Hai…
I am interested to understand clade master tool. I am doing project on taxonomy classification and I have followed all the procedure which you have mentioned in https://github.com/flaxsearch/clade

But I am not understanding how Stanford NER package is used in clade. I went though clasfy.py. I’m very particular in understanding how Stanford NER has been used in taxonomy classification. When I went through documentation of Stanford NER and installed package basically it classify the sentence into location, person, organisation. how this information are used in taxonomy classification.

Kindly let me know the same.

]]>
By: charlie http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24664 Mon, 08 Dec 2014 10:26:58 +0000 http://www.flax.co.uk/blog/?p=791#comment-24664 Hi Kanza, I think you’d be best asking the Stanford team as we’re not the authors of this software.

]]>
By: kanza http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24663 Fri, 05 Dec 2014 16:27:50 +0000 http://www.flax.co.uk/blog/?p=791#comment-24663 i have to extract Person name, location and Organization name using Stanford NER. but i am getting same error
my code is
private CRFClassifier classifier;
private String file =”english.all.3class.distsim.crf.ser.gz”;
CRFClassifier classifier = CRFClassifier.getClassifier(file);
and i am getting error on this line
CRFClassifier classifier = CRFClassifier.getClassifier(file);
and error is
ClassCastException: java.util.Arraylist cannot be cast to [Ledu.stanford.nlp.util.index

someone please me.

]]>
By: charlie http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24662 Tue, 11 Nov 2014 14:10:01 +0000 http://www.flax.co.uk/blog/?p=791#comment-24662 Try the latest Stanford NLP – as other posters have commented that seems to fix the issue.

]]>
By: Dikshant Shahi http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24661 Thu, 06 Nov 2014 07:16:07 +0000 http://www.flax.co.uk/blog/?p=791#comment-24661 Cannot get it to work 🙁

Got the following error:
Loading classifier from /home/dikchant/applications/clade-master/stanford-ner-2011-09-14/classifiers/all.3class.distsim.crf.ser.gz … Exception in thread “main” java.io.FileNotFoundException: classifiers/all.3class.distsim.crf.ser.gz (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:146)

On download as suggested by Pengchu, I get the following error:
dikchant@dikchant-Latitude-E5440:~/applications/clade-master/stanford-ner-2011-09-14$ ./server.sh
Loading classifier from /home/dikchant/applications/clade-master/stanford-ner-2011-09-14/classifiers/english.all.3class.distsim.crf.ser.gz … Exception in thread “main” java.lang.ClassCastException: java.util.ArrayList cannot be cast to [Ledu.stanford.nlp.util.Index;
at edu.stanford.nlp.ie.crf.CRFClassifier.loadClassifier(CRFClassifier.java:2164)
at edu.stanford.nlp.ie.AbstractSequenceClassifier.loadClassifier(AbstractSequenceClassifier.java:1249)
at edu.stanford.nlp.ie.AbstractSequenceClassifier.loadClassifier(AbstractSequenceClassifier.java:1366)
at edu.stanford.nlp.ie.AbstractSequenceClassifier.loadClassifier(AbstractSequenceClassifier.java:1309)
at edu.stanford.nlp.ie.crf.CRFClassifier.getClassifier(CRFClassifier.java:2296)
at edu.stanford.nlp.ie.NERServer.main(NERServer.java:324)

]]>
By: Pengchu http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24660 Fri, 17 Oct 2014 21:11:26 +0000 http://www.flax.co.uk/blog/?p=791#comment-24660 I downloaded from Stanford nlp site for this version and replaced the current one. Seems work well.

]]>
By: Pengchu http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24659 Fri, 17 Oct 2014 19:44:23 +0000 http://www.flax.co.uk/blog/?p=791#comment-24659 All:
I got an exception when I run $ ./server.sh $ in the stanford-ner-2011-09-14 directory:
s952275:stanford-ner-2011-09-14 pzhang$ ./server.sh &
[1] 15115
s952275:stanford-ner-2011-09-14 pzhang$ Loading classifier from /Users/pzhang/clade-master/stanford-ner-2011-09-14/classifiers/all.3class.distsim.crf.ser.gz … Exception in thread “main” java.io.FileNotFoundException: classifiers/all.3class.distsim.crf.ser.gz (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:138)
at edu.stanford.nlp.ie.AbstractSequenceClassifier.loadClassifier(AbstractSequenceClassifier.java:1362)
at edu.stanford.nlp.ie.AbstractSequenceClassifier.loadClassifier(AbstractSequenceClassifier.java:1309)
at edu.stanford.nlp.ie.crf.CRFClassifier.getClassifier(CRFClassifier.java:2296)
at edu.stanford.nlp.ie.NERServer.main(NERServer.java:324)

[1]+ Exit 1 ./server.sh
Any help is great.

Pengchu

]]>
By: charlie http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24658 Mon, 23 Jun 2014 07:56:49 +0000 http://www.flax.co.uk/blog/?p=791#comment-24658 Glad to hear it. Yes, it would be perfectly feasible to index PDFs, we’d use Apache Tika I suspect.

]]>
By: Mark Evans http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24657 Fri, 20 Jun 2014 20:37:38 +0000 http://www.flax.co.uk/blog/?p=791#comment-24657 Charlie,

Thanks for the response. It worked very well upto about 200 results.

Great to hear that you are working on a new release. I will check back frequently. In our case, our source docs are PDFs. I had to extract the text layer before I could index them. Having a module that could pick up a list of PDF files would be a great addition.

Thanks again
Mark

]]>
By: charlie http://www.flax.co.uk/blog/2012/06/12/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool/#comment-24656 Fri, 20 Jun 2014 07:58:19 +0000 http://www.flax.co.uk/blog/?p=791#comment-24656 Hi Mark,

If you change the number set for ‘rows’ in taxonomy.py line 331:
results = query.field_limit(score=True).paginate(rows=10).execute() #FIXME: configurable rows
and re-run the system you should get more documents shown. Bear in mind this may have unintended consequences! We’re working on a new release of Clade with a few improvements, watch this space…

Charlie

]]>