Building CLARIN Infrastructure: from text images or files to online corpus