Please refer to this private repository to find the initial work on this jira:
https://bitbucket.org/srishtitiwari/access-log-parser/src/master/
1. Please note that the current version assumes the default format of access.log in Apache web server while parsing. If the format of the access.log changes, the changes would be required in this parser as well.
2. The python interpreter used was Python 3.6.
3. To set up the script on the server, the administrator needs to set up a cron job. Please refer to the readme file.
4. The things left to do are present as comments in the code. These are as follows:
- Check if permissions exist to read the access.log file
- Change CSV to SQL LITE or any other file-based database
- Change the location of the generated file (undecided)
- Setup build pipeline for this project
Thanks,
Srishti
Queries:
When does the user access http://bioviz.org/igb/releases/current/updates.xml file? (Click event or what triggers it) Exact location or page on website would help!
Do we need to give the count of users who access it?
Does accessing the above link means that the user is downloading IGB?
Do we need to go with Google Analytics ( Tools are available but paid ) or can we go with Command line tool for log analysis?
Is Google Analytics already being used for bioviz site?
Prof [~aloraine] Could you clarify the above queries?