[users@httpd] Does Module exists for manipulating html text?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




I've tried everything at this point and I'm hoping the guru's of the apache list might be able to help!  

We have an interesting request from our client who wants us to remove comments and _javascript_ from their web pages during a request.  Their requirement is that the page size be 6K or less.  because of the dynamic programs they use to build these pages the size becomes about 25K.  They want to dynamically update validation and finally production with all the same content.  They want the end users to request the page and have our web servers filter the response and remove all the extra junk from the page.  

To get this to work I used the ext_filter module and a compiled C program that uses regular expressions to remove all the junk.
ExtFilterDefine  replace cmd="ssFormatter"

The problem with this is that the program is called outside of the Apache process and creates a lot of overhead.  This site gets about a million + hits a day.  Currently we are serving about 800-1000pages/sec on each web server.  After loadtesting with this new filter in place we averaged 11pages/sec.  

Does a  module exist today that has the functionality to rewrite the html response?
Any other ideas out there besides ExtFilterDefine?

Any help would be greatly appreciated.  Thanks everyone!  


Patrick Presto
IT Advisory Specialist
IBM Global Services
720-540-1082

[Index of Archives]     [Open SSH Users]     [Linux ACPI]     [Linux Kernel]     [Linux Laptop]     [Kernel Newbies]     [Security]     [Netfilter]     [Bugtraq]     [Squid]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Video 4 Linux]     [Device Mapper]

  Powered by Linux