Performance of strings.Trim() vs strings.TrimFunc()

John Potocny

unread,

Mar 21, 2015, 2:45:20 PM3/21/15

to golan...@googlegroups.com

Hi everyone,

I spent some time this week refactoring a piece of code to extract part of a string from a larger block, and in the process I discovered that the strings.Trim() function was actually fairly slow, and was allocating memory when I called it. This was fairly surprising to me, since I had always figured the implementation of Trim would be pretty efficient.

I took a look at the implementation to figure this out - here it is below as a reference:


func makeCutsetFunc(cutset string) func(rune) bool {
    return func(r rune) bool { return IndexRune(cutset, r) >= 0 }
}
 

 // Trim returns a slice of the string s with all leading and
 // trailing Unicode code points contained in cutset removed.
 func Trim(s string, cutset string) string {
     if s == "" || cutset == "" {
         return s
     }
     return TrimFunc(s, makeCutsetFunc(cutset))
 }

Now it makes perfect sense to me - the plain strings.Trim() functions work by calling their corresponding strings.TrimFunc() implementation, with the provided cutset argument used in a closure that is used as f(c). This is of course where the allocation comes from, but it seems pretty inefficient to me, and indeed a simple benchmark shows that strings.TrimFunc() can be used in place of strings.Trim() for a fairly substantial performance boost (if you know the cutset).

I played around with the implementation of strings.Trim() and was able to mock up a version that avoids allocation fairly easily - it basically substitutes the current implementation of Trim with that of strings.TrimFunc(), using a hardcoded f(c). This duplicates a bit of code of course, but it also provides a decent speedup (about 20%, based on some simple benchmarks I did).

I'm curious whether anyone would find this useful - do people have any use for a faster Trim() implementation, or is everyone who needs performance already just using TrimFunc()? If people think that the performance boost might be worth the code duplication involved, I'm happy to polish my implementation and submit a CL. Let me know what you guys think!

David DENG

unread,

Mar 21, 2015, 5:47:25 PM3/21/15

to golan...@googlegroups.com

I guess(not testing just from common sense) the following code could avoid allocation(on heap):

func Trim(s string, cutset string) string {
if s == "" || cutset == "" {
return s
}

return TrimFunc(s, func(r rune) bool { return IndexRune(cutset, r) >= 0 })
}

It only expands makeCutsetFunc into Trim. I think the compiler can analyze and know the second parameter of TrimFun doesn't escape and makes the closure allocated on stack.

John Potocny

unread,

Mar 21, 2015, 5:56:17 PM3/21/15

to golan...@googlegroups.com

Tested it, totally works! My benchmarks show that the version I wrote is a little faster, although that is probably because my version has a smaller call stack :)

Brad Fitzpatrick

unread,

Mar 21, 2015, 8:51:20 PM3/21/15

to John Potocny, golang-nuts

Send in a change + additional benchmark!

--
You received this message because you are subscribed to the Google Groups "golang-nuts" group.
To unsubscribe from this group and stop receiving emails from it, send an email to golang-nuts...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all

Reply to author

Forward