Re: Strip emails from a document

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



If you are expecting the email address to always be the same but the first
part being different you can create a regular expression to match it that
way. Using a regular expression over all is going to be your best bet as
shiplu suggested.


On Sat, Jan 26, 2013 at 10:54 AM, shiplu <shiplu.net@xxxxxxxxx> wrote:

> I think you meant extract emails from document, right?
>
> I'd probably find `@` and iterate before and after unless I get posix
> punct, space, characters.  But it'll probably give some false matches. So
> its really hard to find 100% emails from an arbitrary text. This is because
> valid email can contain many different type of characters. According to RFC
> 822 space is a valid character in email. So finding all the valid emails is
> tough.
> In a *trivial situation* an email would be separated by space. So find @
> first. Then go back and front to find the first space. You'll get most
> common emails. Something like using this regex pattern
> [^[:space:]<@]+@[^[:space:]>]+ would suffice.
> But keep in mind, it'll work on trivial cases. Not on special cases.
> Regular expression can not be used on special cases. Here is full RFC-822
> compliant email matching regular expression
> http://ex-parrot.com/~pdw/Mail-RFC822-Address.html
>
>
> More information can be found on
>
> http://stackoverflow.com/questions/201323/using-a-regular-expression-to-validate-an-email-address
>
>
> On Sat, Jan 26, 2013 at 10:24 PM, Tedd Sperling <tedd@xxxxxxxxxxxx> wrote:
>
> > Hi gang:
> >
> > I thought I had a function to strip emails from a document, but I can't
> > find it.
> >
> > So, before I start writing a common script, do any of you have a simple
> > script to do this?
> >
> > Here's an example of the problem:
> >
> > Before:
> >
> > "Will Alex" <alex13@xxxxxxxxxxx>;"Moita Zact" <zact3@xxxxxxxxxxx>;"Bob
> > Arms" <arms1@xxxxxxxxxxx>;"Meia Terms" <termsma@xxxxxxxxxxx>;
> >
> > After:
> >
> > alex13@xxxxxxxxxxx
> > zact3@xxxxxxxxxxx
> > arms1@xxxxxxxxxxx
> > termsma@xxxxxxxxxxx
> >
> > Cheers,
> >
> > tedd
> >
> >
> > _____________________
> > tedd@xxxxxxxxxxxx
> > http://sperling.com
> >
> >
> > --
> > PHP General Mailing List (http://www.php.net/)
> > To unsubscribe, visit: http://www.php.net/unsub.php
> >
> >
>
>
> --
> Shiplu.Mokadd.im
> ImgSign.com | A dynamic signature machine
> Innovation distinguishes between follower and leader
>

[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux