> Remi Lespinet <remi.lespinet@xxxxxxxxxxxxxxxxxxxxxxx> writes: > > > I've some more tests, maybe I should put them all in this post ? > > Yes, please post as much as you have. Ideally, this should be > automatically tested, but if you don't have time to write the automated > tests, at least having a track of what you did on the list archives can > help someone else to do it. It may not be easily readable without colors, so there are the scripts at the end. You can change the tested input by changing lines after the "cat >.tmplist" line in testall.sh. (There are two scripts testall.sh and testone.perl). Here are the tests results: Input: Split: M::A : Same : Yes ---------- Input: Jane Split: Jane M::A : Jane Same : Yes ---------- Input: jdoe@xxxxxxxxxxx Split: jdoe@xxxxxxxxxxx M::A : jdoe@xxxxxxxxxxx Same : Yes ---------- Input: <jdoe@xxxxxxxxxxx> Split: jdoe@xxxxxxxxxxx M::A : jdoe@xxxxxxxxxxx Same : Yes ---------- Input: Jane <jdoe@xxxxxxxxxxx> Split: Jane <jdoe@xxxxxxxxxxx> M::A : Jane <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: Jane Doe <jdoe@xxxxxxxxxxx> Split: Jane Doe <jdoe@xxxxxxxxxxx> M::A : Jane Doe <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: Jane\ Doe <jdoe@xxxxxxxxxxx> Split: "Jane\ Doe" <jdoe@xxxxxxxxxxx> M::A : "Jane \ Doe" <jdoe@xxxxxxxxxxx> Same : No ---------- Input: "Jane" <jdoe@xxxxxxxxxxx> Split: "Jane" <jdoe@xxxxxxxxxxx> M::A : "Jane" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Doe, Jane" <jdoe@xxxxxxxxxxx> Split: "Doe, Jane" <jdoe@xxxxxxxxxxx> M::A : "Doe, Jane" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Doe, Ja"ne <jdoe@xxxxxxxxxxx> Split: "Doe, Ja ne" <jdoe@xxxxxxxxxxx> M::A : "Doe, Ja" ne <jdoe@xxxxxxxxxxx> Same : No ---------- Input: "Doe, Katarina" Jane <jdoe@xxxxxxxxxxx> Split: "Doe, Katarina Jane" <jdoe@xxxxxxxxxxx> M::A : "Doe, Katarina" Jane <jdoe@xxxxxxxxxxx> Same : No ---------- Input: "Jane@:;\>.,()<Doe" <jdoe@xxxxxxxxxxx> Split: "Jane@:;\>.,()<Doe" <jdoe@xxxxxxxxxxx> M::A : "Jane@:;\>.,()<Doe" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: Jane@:;\.,()<>Doe <jdoe@xxxxxxxxxxx> Split: Jane@: : "\." : Doe <jdoe@xxxxxxxxxxx> () M::A : Jane@: : \. : Doe <jdoe@xxxxxxxxxxx> () Same : No ---------- Input: Jane!#$%&'*+-/=?^_{|}~Doe' <jdoe@xxxxxxxxxxx> Split: Jane!#$%&'*+-/=?^_{|}~Doe' <jdoe@xxxxxxxxxxx> M::A : Jane!#$%&'*+-/=?^_{|}~Doe' <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "<jdoe@xxxxxxxxxxx>" Split: "<jdoe@xxxxxxxxxxx>" M::A : "<jdoe@xxxxxxxxxxx>" Same : Yes ---------- Input: "Jane jdoe@xxxxxxxxxxx" Split: "Jane jdoe@xxxxxxxxxxx" M::A : "Jane jdoe@xxxxxxxxxxx" Same : Yes ---------- Input: Jane Doe <jdoe @ example.com > Split: Jane Doe <jdoe@xxxxxxxxxxx> M::A : Jane Doe <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: Jane Doe < jdoe@xxxxxxxxxxx > Split: Jane Doe <jdoe@xxxxxxxxxxx> M::A : Jane Doe <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: Jane @ Doe @ Jane @ Doe Split: Jane@Doe@Jane@Doe M::A : Jane@Doe@Jane@Doe Same : Yes ---------- Input: Jane jdoe@xxxxxxxxxxx Split: Janejdoe@xxxxxxxxxxx M::A : Jane : jdoe@xxxxxxxxxxx Same : No ---------- Input: <jdoe@xxxxxxxxxxx> Jane Doe Split: jdoe@example.comJaneDoe M::A : Jane Doe <jdoe@xxxxxxxxxxx> Same : No ---------- Input: Jane <jdoe@xxxxxxxxxxx> Doe Split: Jane <jdoe@example.comDoe> M::A : Jane Doe <jdoe@xxxxxxxxxxx> Same : No ---------- Input: "Jane, 'Doe'" <jdoe@xxxxxxxxxxx> Split: "Jane, 'Doe'" <jdoe@xxxxxxxxxxx> M::A : "Jane, 'Doe'" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: 'Doe, "Jane' <jdoe@xxxxxxxxxxx> Split: 'Doe : " Jane' <jdoe@xxxxxxxxxxx> M::A : 'Doe : " Jane' <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Jane" "Do"e <jdoe@xxxxxxxxxxx> Split: "Jane" "Do" e <jdoe@xxxxxxxxxxx> M::A : "Jane" "Do" e <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Jane' Doe" <jdoe@xxxxxxxxxxx> Split: "Jane' Doe" <jdoe@xxxxxxxxxxx> M::A : "Jane' Doe" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Jane Doe <jdoe@xxxxxxxxxxx>" <jdoe@xxxxxxxxxxx> Split: "Jane Doe <jdoe@xxxxxxxxxxx>" <jdoe@xxxxxxxxxxx> M::A : "Jane Doe <jdoe@xxxxxxxxxxx>" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Jane\" Doe" <jdoe@xxxxxxxxxxx> Split: "Jane\" Doe" <jdoe@xxxxxxxxxxx> M::A : "Jane\" Doe" <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: Doe, jane <jdoe@xxxxxxxxxxx> Split: Doe : jane <jdoe@xxxxxxxxxxx> M::A : Doe : jane <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Jane Doe <jdoe@xxxxxxxxxxx> Split: " Jane Doe <jdoe@xxxxxxxxxxx> M::A : " Jane Doe <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: "Jane "Kat"a" ri"na" ",Doe" <jdoe@xxxxxxxxxxx> Split: "Jane Kat a ri na ,Doe" <jdoe@xxxxxxxxxxx> M::A : "Jane " Kat "a" ri "na" ",Doe" <jdoe@xxxxxxxxxxx> Same : No ---------- Input: Jane Doe Split: Jane Doe M::A : Jane : Doe Same : No ---------- Input: Jane "Doe <jdoe@xxxxxxxxxxx>" Split: "Jane Doe <jdoe@xxxxxxxxxxx>" M::A : Jane : "Doe <jdoe@xxxxxxxxxxx>" Same : No ---------- Input: \"Jane Doe <jdoe@xxxxxxxxxxx> Split: "\"Jane Doe" <jdoe@xxxxxxxxxxx> M::A : \ " Jane Doe <jdoe@xxxxxxxxxxx> Same : No ---------- Input: Jane\"\" Doe <jdoe@xxxxxxxxxxx> Split: "Jane\"\" Doe" <jdoe@xxxxxxxxxxx> M::A : Jane \ " \ " Doe <jdoe@xxxxxxxxxxx> Same : No ---------- Input: 'Jane 'Doe' <jdoe@xxxxxxxxxxx> Split: 'Jane 'Doe' <jdoe@xxxxxxxxxxx> M::A : 'Jane 'Doe' <jdoe@xxxxxxxxxxx> Same : Yes ---------- Input: 'Jane "Katarina\" \' Doe' <jdoe@xxxxxxxxxxx> Split: "'Jane Katarina\" \' Doe'" <jdoe@xxxxxxxxxxx> M::A : 'Jane " Katarina \ " \ ' Doe' <jdoe@xxxxxxxxxxx> Same : No ********************************************************************** * SCRIPTS PART * ********************************************************************** ---------------------------- testall.sh ---------------------------- #!/bin/sh cat >.tmplist <<EOF Jane jdoe@xxxxxxxxxxx <jdoe@xxxxxxxxxxx> Jane <jdoe@xxxxxxxxxxx> Jane Doe <jdoe@xxxxxxxxxxx> Jane\ Doe <jdoe@xxxxxxxxxxx> "Jane" <jdoe@xxxxxxxxxxx> "Doe, Jane" <jdoe@xxxxxxxxxxx> "Doe, Ja"ne <jdoe@xxxxxxxxxxx> "Doe, Katarina" Jane <jdoe@xxxxxxxxxxx> "Jane@:;\>.,()<Doe" <jdoe@xxxxxxxxxxx> Jane@:;\.,()<>Doe <jdoe@xxxxxxxxxxx> Jane!#$%&'*+-/=?^_{|}~Doe' <jdoe@xxxxxxxxxxx> "<jdoe@xxxxxxxxxxx>" "Jane jdoe@xxxxxxxxxxx" Jane Doe <jdoe @ example.com > Jane Doe < jdoe@xxxxxxxxxxx > Jane @ Doe @ Jane @ Doe Jane jdoe@xxxxxxxxxxx <jdoe@xxxxxxxxxxx> Jane Doe Jane <jdoe@xxxxxxxxxxx> Doe "Jane, 'Doe'" <jdoe@xxxxxxxxxxx> 'Doe, "Jane' <jdoe@xxxxxxxxxxx> "Jane" "Do"e <jdoe@xxxxxxxxxxx> "Jane' Doe" <jdoe@xxxxxxxxxxx> "Jane Doe <jdoe@xxxxxxxxxxx>" <jdoe@xxxxxxxxxxx> "Jane\" Doe" <jdoe@xxxxxxxxxxx> Doe, jane <jdoe@xxxxxxxxxxx> "Jane Doe <jdoe@xxxxxxxxxxx> "Jane "Kat"a" ri"na" ",Doe" <jdoe@xxxxxxxxxxx> Jane Doe Jane "Doe <jdoe@xxxxxxxxxxx>" \"Jane Doe <jdoe@xxxxxxxxxxx> Jane\"\" Doe <jdoe@xxxxxxxxxxx> 'Jane 'Doe' <jdoe@xxxxxxxxxxx> 'Jane "Katarina\" \' Doe' <jdoe@xxxxxxxxxxx> EOF cat .tmplist | while read -r line do echo "Input: $line" ./testone.perl "$line" echo ---------- done ---------------------------- testone.perl ---------------------------- #!/usr/bin/perl use strict; use warnings; use Term::ANSIColor; use Mail::Address; use Text::ParseWords; my $string = $ARGV[0]; sub split_addrs { my $re_comment = qr/\((?:[^)]*)\)/; my $re_quote = qr/"(?:[^\"\\]|\\.)*"/; my $re_word = qr/(?:[^]["\s()<>:;@\\,.]|\\.)+/; my $re_token = qr/(?:$re_quote|$re_word|$re_comment|\S)/; my @tokens = map { $_ =~ /\s*($re_token)\s*/g } @_; push @tokens, ","; my (@addr_list, @phrase, @address, @comment, @buffer) = (); foreach my $token (@tokens) { if ($token =~ /^[,;]$/) { if (@address) { push @address, @buffer; } else { push @phrase, @buffer; } my $str_phrase = join ' ', @phrase; my $str_address = join '', @address; my $str_comment = join ' ', @comment; if ($str_phrase =~ /[][()<>:;@\\,.\000-\037\177]/) { $str_phrase =~ s/(^|[^\\])"/$1/g; $str_phrase = qq["$str_phrase"]; } if ($str_address ne "" && $str_phrase ne "") { $str_address = qq[<$str_address>]; } my $str_mailbox = "$str_phrase $str_address $str_comment"; $str_mailbox =~ s/^\s*|\s*$//g; push @addr_list, $str_mailbox if ($str_mailbox); @phrase = @address = @comment = @buffer = (); } elsif ($token =~ /^\(/) { push @comment, $token; } elsif ($token eq "<") { push @phrase, (splice @address), (splice @buffer); } elsif ($token eq ">") { push @address, (splice @buffer); } elsif ($token eq "@") { push @address, (splice @buffer), "@"; } elsif ($token eq ".") { push @address, (splice @buffer), "."; } else { push @buffer, $token; } } return @addr_list; } sub old_split { quotewords('\s*,\s*', 1, $_[0]); } my @tab = split_addrs($string); my @ref = map { $_->format } Mail::Address->parse($string); # my @old = old_split($string); #can be printed to see the difference my $tabstring = join "\n", @tab; my $refstring = join "\n", @ref; my $same = ($tabstring eq $refstring); $tabstring =~ s/\n/\n : /g; $refstring =~ s/\n/\n : /g; print color 'bold yellow'; print "Split: ", "$tabstring", "\n"; print color 'bold blue'; print "M::A : ", "$refstring", "\n"; if ($same) { print color 'bold green'; print "Same : ", "Yes", "\n"; } else { print color 'bold red'; print "Same : ", "No", "\n"; } print color 'reset'; -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html