How to Use and Setup Emacs's whitespace-mode

Advertise Here

, , …,

This page tells you how to setup emacs's whitespace-mode (bundled with emacs 23), and how to use it.

whitespace-mode renders {spaces, tabs, newlines} characters with a visible glyph. This feature is useful for working with “comma/tab separated values” (CSV, TSV) that's commonly used format for importing/exporting address books or spreadsheets. It's also important in whitespace-significant langs such as Python.

To use it, call whitespace-mode. (【Alt+x】) The command will toggle it on and off, for current file. Call global-whitespace-mode to toggle it globally for current emacs session.

There is also whitespace-newline-mode and global-whitespace-newline-mode. They only show newline chars.

Setting Up whitespace-mode

emacs whitespace
The default rendering of whitespace-mode. Different placement and mix of whitespaces are rendered with different colors. Also, long lines are colored dark purple. (download whitespace_sample_file.txt)
emacs whitespace clean
A clean setup for whitespace-mode.

How to reduce colors in whitespace-mode?

Put the following in your emacs init file:

;; make whitespace-mode use just basic coloring
(setq whitespace-style (quote (spaces tabs newline space-mark tab-mark newline-mark)))

How to make whitespace-mode use the pilcrow sign “¶” for newline instead of the dollar sign?

Put the following in your emacs init file:

;; make whitespace-mode use “¶” for newline and “▷” for tab.
;; together with the rest of its defaults
(setq whitespace-display-mappings
 '(
   (space-mark 32 [183] [46]) ; normal space, ·
   (space-mark 160 [164] [95])
   (space-mark 2208 [2212] [95])
   (space-mark 2336 [2340] [95])
   (space-mark 3616 [3620] [95])
   (space-mark 3872 [3876] [95])
   (newline-mark 10 [182 10]) ; newlne, ¶
   (tab-mark 9 [9655 9] [92 9]) ; tab, ▷
))

In the above, the numbers are Unicode char code in decimal. Depending on your choice of font, some glyphs may not show up or correctly. If so, you can try the following glyphs.

GlyphUnicode
Code Point
(Decimal)
Unicode Name
·183MIDDLE DOT
182PILCROW SIGN
8629DOWNWARDS ARROW WITH CORNER LEFTWARDS
8617LEFTWARDS ARROW WITH HOOK
9166RETURN SYMBOL
9655WHITE RIGHT POINTING TRIANGLE
9654BLACK RIGHT-POINTING TRIANGLE
8594RIGHTWARDS ARROW
8614RIGHTWARDS ARROW FROM BAR
8677RIGHTWARDS ARROW TO BAR
8680RIGHTWARDS WHITE ARROW

See also: Best Fonts for UnicodeComputing Symbols in Unicode.

If you are wondering what are those other decimal numbers in the default whitespace settings, here they are:

DecimalGlyphName or UNICODE NAME
10line feed
32space
46.full stop
95_underscore LOW LINE
160 NO-BREAK SPACE
164¤Symbol, Currency
2208invalid Unicode decimal
2212invalid Unicode decimal
2336DEVANAGARI LETTER TTHA
2340DEVANAGARI LETTER TA
3616THAI CHARACTER PHO SAMPHAO
3620THAI CHARACTER RU
3872TIBETAN DIGIT ZERO
3876TIBETAN DIGIT FOUR

I have no idea why those {Indian, Thai, Tibetan} characters are doing there.

Common Questions for Editing Whitespaces

How to delete all whitespaces?

Command NameArea of ActionAction
whitespace-cleanuptext selectiondelete whitespace in a smart way
delete-trailing-whitespacebufferdelete all trailing whitespaces
delete-whitespace-rectangleselection as rectangledelete whitespace in text selection start/end points as a rectangle

For more fine control of deleting whitespaces, you can use use query-replace, or query-replace-regexp. (➲ Find & Replace with Emacs)

How to insert Tab or Newline char?

The following methods works everywhere (also works in minibuffer).

To insert a literal tab char, press 【Ctrl+q Tab ⇆】.

To insert a newline char, type 【Ctrl+q Ctrl+j】.

You need to use the above method to insert these characters, because for example in minibuffer, pressing Tab ⇆ does name completion and pressing Enter finishes the prompt. In most programing language modes, pressing Enter or Tab ⇆ also does some auto indenting and formatting.

For detail, see: Emacs's Key Notations Explained (/r, ^M, C-m, RET, <return>, M-, meta).

blog comments powered by Disqus