Hello,
I'm working on a paper that compares different word segmentation algorithms, and wordseg has been extremely helpful. The wordseg documentation says that the DPSEG algorithm has a bug in it and "is not fully functional at present". I can't find any information about what the bug is/was and whether it has been fixed. I noticed in the source code that there's a function called _dpseg_bugfix to correct an issue with certain types of input. Is this the bug? I'm just hoping to confirm that DPSEG works properly before we report any results.
Thanks!
Hello,
I'm working on a paper that compares different word segmentation algorithms, and wordseg has been extremely helpful. The wordseg documentation says that the DPSEG algorithm has a bug in it and "is not fully functional at present". I can't find any information about what the bug is/was and whether it has been fixed. I noticed in the source code that there's a function called _dpseg_bugfix to correct an issue with certain types of input. Is this the bug? I'm just hoping to confirm that DPSEG works properly before we report any results.
Thanks!