Inconsistent split Behavior in Python

Inconsistent split Behavior in Python

Saturday, November 05, 2011.

Here’s a futile but cathartic bug report I filed against Python recently.

In Python, string.split and re.split both take an optional argument that limits the number of splits that are done. This is unlike Perl’s split builtin, which limits the number of pieces. But it makes sense I guess, and consistency between the two languages is not something I’d necessarily expect.

However, consistency within a language…a reasonable expectation, no?

The inconsistency lies in how the string.split and re.split handle the edge cases of “do an unlimited number of splits” and “don’t do any splits.” The two agree that “unlimited splits” is the default. They don’t agree on how to interpret the value of an explicit maxsplit parameter.

	maxsplit=0	maxsplit=-1
string.split	no splits	unlimited splits
re.split	unlimited splits	no splits

I think string.split is doing the sensible thing here.

Of course, the “bug” has zero chance of being fixed at this point. I pretty much just filed it to create a search result for others similarly bitten, annoyed, or both.

Posted by Alan on Saturday, November 05, 2011. (Discuss)

blog comments powered by Disqus

maelstrom

"After a little while I became possessed with the keenest curiosity about the whirl itself. I positively felt a wish to explore its depths, even at the sacrifice I was going to make; and my principal grief was that I should never be able to tell my old companions on shore about the mysteries I should see."

Illustration for Edgar Allan Poe's story "Descent into the Maelstrom" by Harry Clarke, published in 1919.

Mar.29.2013. Turn Vim Into Excel: Tips for Editing Tabular Data
Nov.15.2012. How to printf a length-delimited string
Oct.29.2012. Really, Actiontec?
Apr.03.2012. Recovering a Dying iPod Disk
Feb.26.2012. How Many Consonant Pairs Do We Actually Use?
Nov.25.2011. Mutt Tip: Attach Multiple Files
Nov.23.2011. Patching is a Normal Activity
Nov.05.2011. Inconsistent split Behavior in Python
Jun.17.2011. PostgreSQL Tip: Bulk Copying Data Between Tables
Jun.10.2011. Measuring the Measurers
May.17.2011. Put *Everything* in vi Mode
Mar.22.2011. How I Lost $100 and Blamed It On cal(1)
Mar.14.2011. Coding for the Web: A Proposal for Better Inline Syntax Highlighting
Mar.08.2011. Two New Python C Extensions
Mar.02.2011. Teasing Out a New Git Repository
Feb.28.2011. Saving Flash Videos with Linux
Feb.25.2011. Profiling every command in a Makefile
Feb.08.2011. Mapping Python Code Over Records With lwpb
Feb.07.2011. Bouncing, Hopping and Tunneling with tcpforward
Feb.05.2011. A Python Gotcha: References as Default Parameters
Feb.18.2007. Thinkpad T43 Key Removal, Assembly
Feb.05.2007. LCD Smashed, So...Ratpoison
Sep.14.2006. TAI64 For All Time
Feb.14.2005. SSH Pubkey Setup In One Command
Dec.30.2004. Colorful Bash Prompt Generator
Jan.01.2000. Where did this blog come from?