Regular expression for validating url in javascript

Something like this (in Java Script): If you really want to know how, or have some uncertainties, just write below in comments. So, in order not to mix up some bracket that is not part of the URL, we are capturing the brackets before and after URL as well, and all you have to do after matching is to check if the both brackets exist and remove them. Anything else, and this whole exercise is a sham, because people will just write whatever works for them, or how they like it, and sacrifice "making any sense" in favour of being short ( like I did ).The most Naïve regex I can come up with that matches (and captures) all your pasted examples so far is: I think the sane approach is to extract things that are likely to be URI's, then validate them with something stricter, I'm looking at working out how to use the browsers URI class to validate them =).It is supported by all major programming languages (PHP, Perl, Java Script, Java, . Here you can see URL anathomy (along with good SEO practices), or just search across the web for it.


)/gi; /* (^|\s) : ensure that we are not matching an url embeded in an other string (https? : the http or https schemes (optional) [\w-] (\.[\w-] ) \.?

as the regular expression and match valid URIs and not match all your examples on the »Not match« list. In order to be a rule that works as you intend, you actually do need to implement a full RFC compliant matcher, and a full RFC compliant matcher will "worry about not matching".

As long as you're going that route it's simply the question: What is the shortest regular expression that will not match any of the example strings but still catch all re = /(^|\s)((https? So, in terms of "permit not matching", you need to specify exactly which deviations from RFC are permissible.

Now here comes the fun part, once we’ve matched an URL, we can do with it whatever we like.


I just couldn’t get myself to write this right now, but I will if someone is interested. You should read the appropriate RFCs which contain relevant parts of the grammar.


  1. Pingback:

  2. eric   •  

    More than 2 million members have already placed their trust in our network and a few thousand are joining daily.

  3. eric   •  

    Evaluating the robust functionality of a full-blown website against the swipe-rightness of an app was too unfair.

  4. eric   •  

    Many sites like Chatroulette float around the Internet with hopes of becoming the next big thing. Our features are far better than any other random chat site on the Internet.

  5. eric   •  

    Также компания MSI не несет прямую или косвенную ответственность за редактирование и управление контентом. Интеллектуальная собственность Права на наименования и обладание на все логотипы, изображения, фотографии, файлы, иллюстрации, программное обеспечение и существенную информацию, отображаемую в службе MSI Online, и права на интеллектуальную собственность принадлежат компании MSI или ее провайдеру.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>