Text for links in 'Canned Messages' not fully duplicated

srumberg · Post by **srumberg** » Sun Feb 10, 2013 9:48 am

Script URL:
Version of script: 2.4.2
Hosting company:
URL of phpinfo.php:
URL of session_test.php:
What terms did you try when SEARCHING for a solution: canned message link

Write your message below:

If my canned message includes a link to an article, like this...

click here: http://domain.com/support/knowledgebase.php?article=16

...then the message sent to the customer looks like this, MISSING THE ARTICLE NUMBER:

click here: http://www.domain.com/support/knowledgebase.php

The URL itself is still correct, but the anchor TEXT is incomplete.

Post by **Klemen** » Sun Feb 17, 2013 9:19 am

Correct, this is a problem with 2.4.x URL parsing code.

I fix is ready and will be included in 2.5

srumberg · Post by **srumberg** » Sun Feb 17, 2013 5:21 pm

Thank you for confirming and I'm looking forward to the new version. I would like to implement the fix immediately. Would you please post it?

I think the issue may be in function hesk_msgToPlain() at line 758 in common.inc.php (in version 2.4.2):

$from = array('/\<a href="mailto\:([^"]*)"\>([^\<]*)\<\/a\>/i', '/\<a href="([^"]*)"( target="_blank")?\>([^\<]*)\<\/a\>/i');

Thank you so much

Post by **Klemen** » Mon Feb 18, 2013 9:49 am

Actually, it's the hesk_makeURL function inside common.inc.php that takes care of this.

This is what 2.5 will use, it should keep query strings in URLs and detect almost any link type, for example ftp://, sftp://, torrent://, file:// etc...

Code: Select all

function hesk_makeURL($text)
{
	$rexScheme    = '[a-z\d+]*:/{2}';
	$rexDomain    = '(?:[-a-zA-Z0-9]{1,63}\.)+[a-zA-Z][-a-zA-Z0-9]{1,62}';
	$rexIp        = '(?:[1-9][0-9]{0,2}\.|0\.){3}(?:[1-9][0-9]{0,2}|0)';
	$rexPort      = '(:[0-9]{1,5})?';
	$rexPath      = '(/[!$-/0-9:;=@_\':;!a-zA-Z\x7f-\xff]*?)?';
	$rexQuery     = '(\?[!$-/0-9:;=@_\':;!a-zA-Z\x7f-\xff]+?)?';
	$rexFragment  = '(#[!$-/0-9:;=@_\':;!a-zA-Z\x7f-\xff]+?)?';
	$rexUsername  = '[^]\\\\\x00-\x20\"(),:-<>[\x7f-\xff]{1,64}';
	$rexPassword  = $rexUsername; // allow the same characters as in the username
	$rexUrl       = "($rexScheme)?(?:($rexUsername)(:$rexPassword)?@)?($rexDomain|$rexIp)($rexPort$rexPath$rexQuery$rexFragment)";
	$rexTrailPunct= "[)'?.!,;:]"; // valid URL characters which are not part of the URL if they appear at the very end
	$rexNonUrl    = "[^-_$+.!*'(),;/?:@=&a-zA-Z0-9]"; // characters that should never appear in a URL
	$rexUrlLinker = "{\\b$rexUrl(?=$rexTrailPunct*($rexNonUrl|$))}";

	/**
	*  List source:  http://data.iana.org/TLD/tlds-alpha-by-domain.txt
	*  Last updated: 2012-08-08
	*/
	$validTlds = hesk_array_fill_keys(explode(" ", ".ac .ad .ae .aero .af .ag .ai .al .am .an .ao .aq .ar .arpa .as .asia .at .au .aw .ax .az .ba .bb .bd .be .bf .bg .bh .bi .biz .bj .bm .bn .bo .br .bs .bt .bv .bw .by .bz .ca .cat .cc .cd .cf .cg .ch .ci .ck .cl .cm .cn .co .com .coop .cr .cu .cv .cw .cx .cy .cz .de .dj .dk .dm .do .dz .ec .edu .ee .eg .er .es .et .eu .fi .fj .fk .fm .fo .fr .ga .gb .gd .ge .gf .gg .gh .gi .gl .gm .gn .gov .gp .gq .gr .gs .gt .gu .gw .gy .hk .hm .hn .hr .ht .hu .id .ie .il .im .in .info .int .io .iq .ir .is .it .je .jm .jo .jobs .jp .ke .kg .kh .ki .km .kn .kp .kr .kw .ky .kz .la .lb .lc .li .lk .lr .ls .lt .lu .lv .ly .ma .mc .md .me .mg .mh .mil .mk .ml .mm .mn .mo .mobi .mp .mq .mr .ms .mt .mu .museum .mv .mw .mx .my .mz .na .name .nc .ne .net .nf .ng .ni .nl .no .np .nr .nu .nz .om .org .pa .pe .pf .pg .ph .pk .pl .pm .pn .post .pr .pro .ps .pt .pw .py .qa .re .ro .rs .ru .rw .sa .sb .sc .sd .se .sg .sh .si .sj .sk .sl .sm .sn .so .sr .st .su .sv .sx .sy .sz .tc .td .tel .tf .tg .th .tj .tk .tl .tm .tn .to .tp .tr .travel .tt .tv .tw .tz .ua .ug .uk .us .uy .uz .va .vc .ve .vg .vi .vn .vu .wf .ws .xn--0zwm56d .xn--11b5bs3a9aj6g .xn--3e0b707e .xn--45brj9c .xn--80akhbyknj4f .xn--80ao21a .xn--90a3ac .xn--9t4b11yi5a .xn--clchc0ea0b2g2a9gcd .xn--deba0ad .xn--fiqs8s .xn--fiqz9s .xn--fpcrj9c3d .xn--fzc2c9e2c .xn--g6w251d .xn--gecrj9c .xn--h2brj9c .xn--hgbk6aj7f53bba .xn--hlcj6aya9esc7a .xn--j6w193g .xn--jxalpdlp .xn--kgbechtv .xn--kprw13d .xn--kpry57d .xn--lgbbat1ad8j .xn--mgb9awbf .xn--mgbaam7a8h .xn--mgbayh7gpa .xn--mgbbh1a71e .xn--mgbc0a9azcg .xn--mgberp4a5d4ar .xn--o3cw4h .xn--ogbpf8fl .xn--p1ai .xn--pgbs0dh .xn--s9brj9c .xn--wgbh1c .xn--wgbl6a .xn--xkc2al3hye2a .xn--xkc2dl3a5ee0h .xn--yfro4i67o .xn--ygbi2ammx .xn--zckzah .xxx .ye .yt .za .zm .zw"), true);

	$html = '';

    $html = '';

    $position = 0;
    while (preg_match($rexUrlLinker, $text, $match, PREG_OFFSET_CAPTURE, $position))
    {
        list($url, $urlPosition) = $match[0];

        // Add the text leading up to the URL.
        $html .= substr($text, $position, $urlPosition - $position);

        $linkText    = $match[0][0];
        $scheme      = $match[1][0];
        $username    = $match[2][0];
        $password    = $match[3][0];
        $domain      = $match[4][0];
        $afterDomain = $match[5][0];

        // Check that the TLD is valid or that $domain is an IP address.
        $tld = strtolower(strrchr($domain, '.'));

        if (preg_match('{^\.[0-9]{1,3}$}', $tld) || isset($validTlds[$tld]))
        {
            // Do not permit implicit scheme if a password is specified, as
            // this causes too many errors (e.g. "my email:foo@example.org").
            if (!$scheme && $password)
            {
                $html .= $username;

                // Continue text parsing at the ':' following the "username".
                $position = $urlPosition + strlen($username);
                continue;
            }

            if (!$scheme && $username && !$password && !$afterDomain)
            {
                // Looks like an email address.
                $completeUrl = "mailto:$url";
                $linkText = $url;
            }
            else
            {
                // Prepend http:// if no scheme is specified
                $completeUrl = $scheme ? $url : "http://$url";
            }

            $linkHtml = '<a href="' . $completeUrl . '">' . $linkText . '</a>';

            // Cheap e-mail obfuscation to trick the dumbest mail harvesters.
            $linkHtml = str_replace('@', '&#64;', $linkHtml);

            // Add the hyperlink.
            $html .= $linkHtml;
        }
        else
        {
            // Not a valid URL.
            $html .= $url;
        }

        // Continue text parsing from after the URL.
        $position = $urlPosition + strlen($url);
    }

	// Add the remainder of the text.
	$html .= substr($text, $position);
	return $html;
} // End hesk_makeURL()

srumberg · Post by **srumberg** » Mon Feb 18, 2013 3:27 pm

Thank you very much for the info and posting the code.

One suggestion might be to put the array of TLDs, which is dynamic data subject to change (such as when new TLDs are released), into a common file such as hesk_settings_inc.php so that the values can be updated without having to touch the script code itself. In other words, separate dynamic data/variables from the code for ease of maintenance. I'm new to your system so I'm not familiar with your code yet and you've probably already considered this, but just wanted to mention it.

Thanks again

srumberg · Post by **srumberg** » Wed Feb 27, 2013 10:31 am

FYI, I replaced the hesk_makeURL() in version 2.4.2 entirely with the code Klemen so kindly posted above and it solved the problem. In other words, you don't need to wait for 2.5 to use the updated function.

Post by **Klemen** » Wed Feb 27, 2013 3:52 pm

Yes, this can be using with 2.4.2 as it doesn't use any new settings or functionality of 2.5.

Glad to hear that solved it for you.

PHPJunkyard support forum

Text for links in 'Canned Messages' not fully duplicated

Text for links in 'Canned Messages' not fully duplicated

Re: Text for links in 'Canned Messages' not fully duplicated

Re: Text for links in 'Canned Messages' not fully duplicated

Re: Text for links in 'Canned Messages' not fully duplicated

Re: Text for links in 'Canned Messages' not fully duplicated

Re: Text for links in 'Canned Messages' not fully duplicated

Re: Text for links in 'Canned Messages' not fully duplicated