Agile Methodology vs classical development
- continuous testing of the application/ automated regression testing
- only implement the features that the client need
- work in short monthly cycles
- actual features/working code over documentation
- always have a shippable version (especially at the end of each cycle)
- always keep a prioritized list of features and implement the first feature on the list
- develop a reliable method of estimating (like planning games)
- no plan is set in stone, you adapt to the client as you go along.
General Tips
ami ec2 quick LAMP install
yum install -y php-mysql php-xml php-mcrypt php-mbstring php-gd yum install -y php-pear yum install -y mysql-server yum install -y php-devel yum install php-pecl-apc yum install -y php-pear php-devel httpd-devel chkconfig --levels 235 httpd on chkconfig --levels 235 mysqld on service mysqld start service httpd start echo max_allowed_packet=16M>>/etc/my.cnf echo delayed_insert_timeout=10000>>/etc/my.cnf echo connect_timeout=100000>>/etc/my.cnfNote: if you need pretty url, make sure that mod_rewrite is enabled (Enabled by default on centos and Ubuntu: a2enmod mod_rewrite). Then make sure the httpd config /etc/httpd/conf/httpd.conf allows overrides AllowOverride all) Then, if I need drush, I install it with the following 2 lines:
pear channel-discover pear.drush.org pear install drush/drush
Installing an ssl host
Create linux swap file
useful shell one liners
- Get the current external ip address: GET checkip.dyndns.org | tr -d ' /:-z'
- Get an estimation of postfix mail queue: mailq|wc -l
- Find out if a process is running from name: ps uxa|grep -i processname
- Get a list of directories sorted by size: du -s -h *|sort -h
- ping a list of ips: cat file-of-ips | xargs -n 1 -I ^ -P 50 ping ^
- ping a list of hosts in a file: cat hosts|xargs -n1 ping -c 1
Node.JS+MongoDB on CentOS
#Install the EPEL/ IUS repositories from http://iuscommunity.org/ wget http://dl.iuscommunity.org/pub/ius/stable/CentOS/6/x86_64/epel-release-6-5.noarch.rpm wget http://dl.iuscommunity.org/pub/ius/stable/CentOS/6/x86_64/ius-release-1.0-11.ius.centos6.noarch.rpm yum localinstall epel-release-6-5.noarch.rpm yum localinstall ius-release-1.0-11.ius.centos6.noarch.rpm #Install Mongo DB: yum install mongodb-server #install build tools yum install openssl-devel gcc-c++ gcc #Download node.js wget http://nodejs.org/dist/node-latest.tar.gz tar xzvf node-latest.tar.gz #change to node directory - version 0.10.5 as of this writing #your actual directory name may be slightly different cd node-v0.10.5 #build node ./configure make make install
Use version control
- Source code history - no matter how good you are in development, there are times where you need to go back to a previous state. By incrementally commiting and commiting often, you have multiple stages to which you can revert and this can be done fast
- Multiple branches for multiple features - you can work on different features in different branches. Experiment all you want and even compare out easily different versions of the same code by alternating between branches.
- Auto push to production server- it is possible to push code to the production server easily when you feel there is a positive change. Not only the process can be made easier but because it is less of a hassle, you will be encouraged to commit more often. Further, when changes are made on the server, it is easier to incorporate them back transparently. It is also less risky because you can always revert back
- It helps with documentation - if you add a meaningful message with every commit, you can track what you have done more easily and remember later
- You can get a feel for progress - not only you get a list of commits but many benchmarks can be used to feel how you are doing (#of commits)
- Custom code - many times code needs to be developed for a custom client. Other times, you are extending an existing open source project... the custom code can be merged easily
- Better disaster recovery -because it is so easy to commit to different servers, there is no central code repository... at every commit to a remote repository, a backup is made. Even if the hard drive dies... either on the server or on the development machine, instant recovery can be made
- Code sharing - everyone can get the latest code. they also can check out each other's branches and help each other without necessarily messing up their own code or having to be on your server
- Conflict resolution - Sometimes two people will change the same code and a potential conflict will be introduced... the software can flag some of these changes and prompt you to merge everything or only partially.
- team awareness - if a developer is not available, another can always pick up in case of emergency, knowing what was changed and the file's history
- robustness - the application ends up more robust because it is integrated more often and conflicts are identified early.
- increased productivity - when working individualy, team will often develop the same code either knowingly because they need a working version or unknowingly because they do not know what others have done. Working in this way, not only this is avoided but a fresh pair of eyes on each other's code can make things more robust.
- Flexibility in code sharing - Since there can be more than one repository. It is easy to imagine a tree of repositories where code is shared among the team and merges are done only when needed. Sensitive parts can be kept out of the main repository while allowing collaboration among the people who need it
- Better problem handling - When working in a team, many problems can arise. Whether it is the rare malicious intent or it is simply forgetting who did what, it is a lot easier if you can pinpoint the origin of a particular change. This can be used in a positive way too: if someone comes out with a really clever fix, it is easier to know who to praise.
Optimize performance
Optimization tip: Optimize database requests
Relational databases rock when it comes to extracting data but they are very resource hungry. Here are a few things you can do- Indices - I have seen cases where adding a few indices has turned a site crawling on its knees to a fast responsive web site
- cache queries in computed form/ ready to serve form
- examine the possibility of using flat files or non relational databases like noSQL... especially if data is read only with simple format and query rarely changes
- consider grouping multiple insert statements into one if your database supports it
- consider optimizing queries - if for example you do not need to have a join, do not include it
- consider using transactions when multiple database requests... all transactions sent at once. I once cut down the importing of a huge database by at least one order of magnitude by encapsulating multiple queries into a transaction
- consider database replication - sometimes by having more than one server that replicate, you can optimize performance. This works especially well in intranets when users are grouped in a limited amount of sites.
- consider stored procedures... they will increase load on the database server but decrease data sent back and forth between application and db server. also, it happens on the server so, happens on the raw data.
Optimization tip: Consider a daemon process
Sometimes you need to access data or resources with a long init process. By offloading this to a daemon that stays in memory and optimizes calls, some long processes can make the query instantaneous.Optimization tip:Consider a cron job
Some tasks such as caching, prefetching data can be offloaded to a cron job that would run at regular intervals, preferably to offpeak timesOptimization tip: Consider using specialized servers
Search engines are a good example. Some machines will crawl, others will index the data and others will serve the data... If you have limited web power resources such as a VPS. Consider running some processes at home where having a powerful machine is free for you.Optimization tip: Optimize code
- Consider JIT compiler or compiler for interpreted languages. Compilers such as hiphop for php allow huge performance savings and decreasing memory footprint. Sometimes it may make sense to rewrite one part of code into a lower level language such as C
- Delay framework usage for simple stuff... frameworks allow very cool things such as MVC and simplify code reuse but they have a performance cost
- See if you can optimize the algorithms to use less resources. There are times where object oriented is more appropriate and allows to swap in/ swap out new algorithms on the fly
- Prune out unused code regularly
- make sure there are no race conditions and identify places where you can speed up function calls. Once, an application was very slow simply because it repeatedly queried the same hostnames multiple times. By identifying duplicate calls and reusing results, the site was much faster
- Check if network factors such as latencies, error rates need improvement (Once a whole site was brought down by a faulty ethernet cable)
Optimization tip: Load balancing
- consider load balancing if necessary - When more traffic comes, sometimes it makes sence to offload the work to more than one server. Many situations exist, among others cluster and round robin solutions. Having more than one server available has the added benefit of allowing for failsafe scenarios if one server goes down and also scaling more easily by adding servers as needed.
Optimization tip: cache content
RSS feeds, databases and fetched data like social network interaction can bring lots of power to your site... but it can really slow your website. Make sure the cache refreshes enough yet refresh as little as possible (in some cases data can be cached for days). Either in a simple way or through specialized processes like memcache. This particularly holds true for anonymous access of the landing page which often contains non user specific content.Optimization tip: Alternative servers
servers nginx like nginx can instantly boost your performance (though to be fair apache has done efforts to catch up). Consider alternatives to all server processes.Optimization tip: consider different/ simpler programs
Some great software is out which can do tons of stuff... yet at the cost of added resources. If all you need is a picture gallery, consider a simple gallery program and not a big complicated CMS that will have a picture featureOptimization tip: consider static pages
Many times, dynamic pages are necessary... but sometimes, a simple html file will do, maybe offloading the dynamic part to a scriptOptimization tip: Separate Static content
It is good to host static content on a separate server. Even if your server is under heavy load, the images/ css will load faster fasterOptimization tip: Only run the services needed
On development machines, it is tempting to enable tons of services that you will not use for fun and wow factor... on production machines, run only what is needed. Not only will your server be more responsive but it makes your installation more secure with less security holes.Optimization tip: optimize images, css, preload
- Order of HTML matters, make sure you load the important parts first in the page.
- If many images, preload images if possible, it will make your site feel more responsive
- if you can compress the css and/or optimize it it is good
- consider offloading to javascript client side for more responsive UI (thinks like validation, dependent combos, auto complete)
- consider reloading partially page with AJAX... more responsive, less traffic
- with complex javascript, make sure your javascript ui remains responsive, only do the needed stuff upfront or page may become un responsive
- If possible, avoid multiple copies of the same file (image, music, etc) across pages... consumes more bandwidth, makes site less responsive and makes maintainance harder
- If your page will take a long time, make sure at least something displays
Building your reputation
When you work as a consultant, especially if you are independent, it is good to look at the long run so a few things can help:
- Have a methodology, do your homework. If you come in for a meeting, come with questions to ask, maybe a mock screen of what you think it will look like. It will encourage discussion
- Ask for reference/testimonial in writing after a work is well done... it may be hard to keep in touch of the person years later and it is always handy. IF the customer is happy, they will be happy to oblige.
- Always provide a few extra features and make sure the client knows they were provided as an extra feature (obviously get paid)
- If you provide a discount, make sure the client knows he is getting something for free
- Always take notes in a meeting, it will allow you to remember the client's ideas and concerns
In the end it is about perceived value about your work and keeping the customer happy.
Kloxo: park many domains at once
Document & script as much as you can
- By documenting the process, you can always know what was done months ago. This is specially important in teams where the person who implemented the server is unavailable during an emergency
- Because the process is documented, it is a lot easier to do the second time around. Not only do server crashes/ upgrades happen but who knows when you will need an additional server. Ideally, further than documenting the process, even allow for installation scripting... that will avoid errors when reinstalling in a rush.
- In case of a problem, it covers your bases (client or boss forget they actually asked you to do something in the first place), your mind is at ease because you can go back and say why things were done the way they were.
- It allows you to know how long things take to do things so you can more easily estimate future work
- By writing it down, it actually commits the experience to memory better and you can reflect on what to do better the next time
- Obviously, the speed is even greater
- it avoids errors and omissions
- there will be no hesitation if you need to reinstall the whole thing
- If you ever get hacked (and I hope not), you're back up with a new machine almost instantaneously
- if the backup process is in a cron job, you will neither forget nor procrastinate, the computer does it on itself
- by having more than one copy, if one goes bad then you can rely on any other copy. A standard practice is to have five copies of the data, to be able to revert to any previous day should there be a problem. Though it some cases it is not practical because the data does not changes or it is not practical.
- With source code, use a revision control system such as GIT, CVS or subversion (does not matter which one, use one). if "it worked yesterday", you can always go back to yesterday's version
KISS Principle
A good way to be productive is the KISS principle (Keep It Simple & Sweet or Keep It Simple, Stupid depending on who you ask :) )
When you start a project it may appear tempting to implement all features with buzzwords bells and whitles, making everything perfect. Unfortunately, this may lead to tons of delays and end up with a very complex product that is not adapted to the real needs, especially if the product is at the proof of concept stage.
The KISS principle says that it is good to have a working imperfect product deployed, get feedback and bring only the additional features. Structure your product into easy to master pieces and implement each one as simply as you can... you can always come back and improve on the pieces with missing functionality afterwards.