Skip to content
Snippets Groups Projects
  1. Jan 22, 2011
  2. Dec 29, 2010
  3. Dec 27, 2010
    • Tom N Harris's avatar
      Indexer v3 Rewrite part one (unstable) · 00803e56
      Tom N Harris authored
      The indexer functions have been converted to a class interface.
      Use the Doku_Indexer class to access the indexer with these public methods:
        addPageWords
        addMetaKeys
        deletePage
        tokenizer
        lookup
        lookupKey
        getPages
        histogram
      These functions are provided for general use:
        idx_get_version
        idx_get_indexer
        idx_get_stopwords
        idx_addPage
        idx_lookup
        idx_tokenizer
      These functions are still available, but are deprecated:
        idx_getIndex
        idx_indexLengths
      All other old idx_ functions are unsupported and have been removed.
      00803e56
  4. Nov 20, 2010
  5. Nov 18, 2010
  6. Nov 17, 2010
  7. Nov 16, 2010
  8. Nov 15, 2010
    • Michael Hamann's avatar
      Indexer improvement: regex instead of arrays for lines · 4753bcc0
      Michael Hamann authored
      When updating a single line that line was split into an array and in a
      loop over that array one entry was removed and afterwards a new one
      added. Tests have shown that using a regex for doing that is much faster
      which can be easily explained as that regex is very simple to match
      while a loop over an array isn't that fast. As that update function is
      called for every word in a page the impact of this change is
      significant.
      4753bcc0
    • Michael Hamann's avatar
      Indexer improvement: Only write the words index when needed · e5e50383
      Michael Hamann authored
      This adds a simple boolean variable that tracks if new words have been
      added. When editing a page in many cases all words have already been
      used somewhere else or just one or two words are new. Until this change
      all words indexes read were always written, now only the changed ones
      are written. The overhead of the new boolean variable should be low.
      e5e50383
    • Michael Hamann's avatar
      Indexer improvement: replace _freadline by fgets · 037b5573
      Michael Hamann authored
      In PHP versions newer than 4.3.0 fgets reads a whole line regardless of
      its length when no length is given. Thus the loop in _freadline isn't
      needed. This increases the speed significantly as _freadline was called
      very often.
      037b5573
    • Michael Hamann's avatar
      Indexer speed improvement: joined array vs. single lines · 06af2d03
      Michael Hamann authored
      From my experience with a benchmark of the indexer it is faster to first
      join the array of all index entries and then write them back together
      instead of writing every single entry. This might increase memory usage,
      but I couldn't see a significant increase and this function is also only
      used for the small index files, not for the large pagewords index.
      06af2d03
    • Tom N Harris's avatar
      tokenizer was returning prematurely · 5bcab0c4
      Tom N Harris authored
      5bcab0c4
  9. Nov 14, 2010
  10. Nov 13, 2010
  11. Nov 12, 2010
  12. Nov 11, 2010
  13. Nov 07, 2010
Loading