Strip Html

// Used when we want to completely remove HTML code and not encode it with XML entities.
public static string StripHtml(this string input)
    // Will this simple expression replace all tags???
    var tagsExpression = new Regex(@"</?.+?>");
    return tagsExpression.Replace(input, " ");
var htmlText = "<p>Here is some text. <span class="bold">This is bold.</span> Talk to you later.</p>;
var cleanString = htmlText.StripHtml();


Removes any HTML or XML tags from a string. Super simple, but I didn't see anything here like it. I've created similar methods in the past to take into account things like <script> blocks, but I'm not worrying about that here.


Double click on the code to select all.