Tag: hpc

Parallel Build Benchmark

Published by Silveira on 2008-11-14

You can optimizing your building times using a parallel build process.

The GNU Make supports it using the parameter –jobs=N (or -j=N), where N is the maximum number of jobs that can be realized at the same time, usually compilation jobs. By default, Make will perform just a job per time. Using -j without arguments imposes no limits on job number. There’s also the load approach using –load-average.

Here’s a benchmark I did showing four different complete builds of the Inkscape project, from one to four jobs at the same time. I used a Intel (Yonah FSB667 Mhz/2MbL2) Dual Core with 2 Gb of Ram with a common Ubuntu 8.10 and default build parameters and no additional optimizations.

chart inkscape_parallel_build.ods

Just compiling with make –jobs=2 instead of just make, almost doubles the speed of the build. As I’m using a dual core processor and the heavy compilations dominate the build process, the best result was with 2 jobs.

I had no trouble with those builds but it’s known that you can have problems such implicit dependencies among targets or memory exhaustion. There’s a good article, Optimizing Build Times Using Parallel “make”, on this subject. On the Make manual, there’s also a section on parallel excetution.

So, next time you try a make, try using make –jobs=2 and see the results. 🙂

Event Review: Comsolid

Published by Silveira on 2008-06-30

This week I did another presentation outside my city. This time it was at Maracanau in the Comsolid, a open source and digital inclusion event.

My first presentation was about ZFS filesystem and how you can take benefits from it like pooling storage and self healing. I used as base for examples my last post on it, Trying to corrupt data in a ZFS mirror.

My next talk was about OpenSolaris. We had a lot of questions and interesting about this. We burned some cds with OpenSolaris 2008.5 and also distributed others versions of OpenSolaris like Solaris 10.

And my last presentation was a quick talk about high performance computing, a short version on that I already did before.

Was a interesting event mainly because the public was composed primarily by young students with few background on TI. It was a challenge to present some new concepts like pooling storage for those who aren’t familiar with filesystems management. I tried to keep my talk as simpler as I could and focus on daily problems and showing that you can avoid them with some open source technologies.

The full album is available at http://flickr.com/photos/silveiraneto/sets/72157605632001295/.

International Free Software Forum 2008

Published by Silveira on 2008-05-10

Every year in Porto Alegre, Brazil, is placed the biggest free software event in the world. Is the International Forum on Free Software, FISL. This year the event counted with 21 countries, 257 presentations and more than 7 thousands hackers, students, developers and entrepreneurs together sharing knowledge and making friends.

FISL 2008 Theater

Just a few hours after NetBeans in Fortaleza. I was flying to a long trip to Porto Alegre (almost a entire day) to join in three events, the FISL 9.0 itself and also OpenSolaris Day Porto Alegre and Javali 2008.

Solaris Express and Coffee express
I like my coffee like my Solaris, Express. 😛 Installing a newer version during a free time in the airport.

At OpenSolaris Day I presented High Performance Computing and OpenSolaris showing an introduction about parallel computing concepts and a little bit about how to take advantage of OpenSolaris for HPC, using tools like ZFS and Dtrace for OpenMPI. Was a good presentation and I got good questions.

Audience

Me on OpenSolaris Day

After the OpenSolaris Day/Javali 2008 we all had a pizza party. I was really sick during my presentation, I’m not familiar with temperatures beyond 25Â° and that day was 8Â°.

Pizza party

Some Sun Campus Ambassadors

The presentation I prepared for FISL was “NetBeans: Beyond Java” showing a little bit how you can use NetBeans to develop using Ruby, C, C++ and others languages. I’d like to show that NetBeans is more than a Java IDE. I showed more about the Ruby and Ruby and Rails integration.

Some photos:

NetBeans on FISL

NetBeans at FISL

My second presentation on FISL was about JavaFX. This presentation was not really planned and I have just a couple of days to organize it. Fortunately I contacted the JavaFX community from openjfx project and immediately I got a lot of help to build some material. A very sincerely and special thanks for James L. Weaver who helped me immediately a lot. Thanks too to the Planet JFX community and their material.

JavaFX on FISL

Was really a good demo. I was more relaxed than in my Netbeans presentation and also I got a excellent feedback.

More photos:

OpenSolaris User Group

OLPC XO

OpenSolaris
Thirtankar Das talked about project Indiana.

Man and child using their laptops

Rafael Vanoni talking about OpenSolaris kernel scheduling.

Roger Brinkley talking about PhoneME.

high 5

Fracois Orsini, Silveira Neto and Ted Goddard
Fracois Orsini, me and Ted Goddard.

Gregg Sporar on Java memory leaks.

Raghavan “Rags” Srinivas on Java runtime.

Louis Suarez-Potts and Vitorio Y. Furusho talking. See also this excellent interview with Louis.

Ray Gans on OpenJDK.

Rich Sands on OpenJDK
Rich Sands also on OpenJDK.

Meet Sun SPOT
Gary Thompson showing a Sun SPOT vehicle.

Rafael David Tinoco on UltraSparc and OpenSparc.

Campus Party on FISL
SÃ©rgio Amadeu da Silveira, Roberto Andrade e Marcelo D’Elia Branco in a informal retrospective about Campus Party.

Marge
Lucas Bortolaso Torri and Bruno Cavaler Ghisi talking about Marge Framework.

Rich Sands, me and Eduardo Lima

Be at FISL was a dream for me for a long time and finally I could achieve this year, and more specially participating as speaker. In the other hand, I spent lot of time finishing and preparing my demos and could not completely enjoy the event itself, but was a really good event, I meet a lot of people I only knew by mails lists and also meet a lot of people from Sun’s staff.

Porto Alegre

Dinner

Porto Alegre is also a very beautiful and well preserved city though I had almost no time to see it. And if during the daytime I almost don’t ate, during the night I went to very good restaurants and churrascarias. I went back to home some kilos fatter. 😛

ps.: I took hundreds of photos. There a set of them in my Flickr.
ps. 2: I tried to put the name of all who appeared in my photos. If I did a mistake, let me know, please.
ps. 3: I had a problem with my file system and I lose those slides I presented in FISL. 🙁 The only available is High Performance Computing and OpenSolaris.

High-performance Computing and Opensolaris

Published by Silveira on 2008-04-16

Slides from the talk I did at OpenSolaris Day in Porto Alegre.

| View | Upload your own

Download: hpc_and_OpenSolaris.odp

OlÃ¡ Mundo Paralelo com MPI

Published by Silveira on 2007-08-29

MPI Ã© a sigla para Message Passing Interface, um padrÃ£o de comunicaÃ§Ã£o de dados para computaÃ§Ã£o paralela. O MPI oferece diversas abstracÃ§Ãµes que facilitam e padronizam o desenvolvimento de aplicaÃ§Ãµes paralelas. Por exemplo, vocÃª pode programar para vÃ¡rios processadores, nÃ³s de um cluster, supercomputadores ou Internet utilizando a mesma infraestrutura transparentemente.

Supercomputador Nasa
Cluster Columbia da NASA, com 1024 nÃ³s.

Como MPI Ã© um padrÃ£o, existem vÃ¡rios padrÃµes de implementaÃ§Ã£o, abertas, fechadas, comerciais ou gratuitas. MPI Ã© definido a princÃpio para C e Fortran, mas hÃ¡ implementaÃ§Ãµes em outras linguagens como Java ou Python, por exemplo. A implementaÃ§Ã£o que eu vou utilizar nesse exemplo Ã© a OpenMPI.

A notÃcia boa Ã© que vocÃª nÃ£o precisa ter um supercomputador em casa para aprender e praticar computaÃ§Ã£o paralela, uma mÃ¡quina domÃ©stica serve. Se vocÃª tiver uma mÃ¡quina com mÃºltiplos processadores, melhor ainda.

InstalaÃ§Ã£o

Para instalar um ambiente de desenvolvimento para MPI no Ubuntu Linux basta um comando:

sudo apt-get install build-essential openmpi-dev

Isso vai instalar um conjunto bÃ¡sico de compiladores e o ambiente OpenMPI.

O cÃ³digo

Vamos criar um arquivo chamado ola.c com o conteÃºdo:

#include
#include
int size, rank;
int main(int argc, char *argv[]){
   MPI_Init(&argc,&argv);
   MPI_Comm_size(MPI_COMM_WORLD,&size);
   MPI_Comm_rank(MPI_COMM_WORLD,&rank);
   printf("Oi. Eu sou o processo %d de %d\n", rank, size);
   MPI_Finalize();
}

CompilaÃ§Ã£o

Para compilar esse cÃ³digo vamos usar o comando mpicc que foi instalado junto com o pacote openmpi-dev. Ele Ã© uma interface para o gcc, e vai cuidar de toda a linkagem com as bibliotecas do MPI. VocÃª pode usar os parÃ¢metros do gcc com o mpicc.

mpicc ola.c -o ola

Se tudo der certo esse comando vai criar o binÃ¡rio ola.

ExecuÃ§Ã£o

Outra ferramenta importante Ã© o mpirun, que levantar o mpi nos diversos nÃ³s e mandar cada nÃ³ executar o binÃ¡rio. O mpirun nÃ£o precisa de um programa mpi para rodar, por exemplo, se dermos esse comando:

mpirun -np 4 echo oi

VocÃª vai ter essa saÃda:

oi
oi
oi
oi

VocÃª mandou 4 nÃ³s (-np 4) executar o comando echo oi (imprime oi). Para mandar 5 nÃ³s executarem nosso binÃ¡rio ola:

mpirun -np 5 ola

E vamos ter uma saÃda mais ou menos assim:

Oi. Eu sou o processo 1 de 5
Oi. Eu sou o processo 4 de 5
Oi. Eu sou o processo 0 de 5
Oi. Eu sou o processo 2 de 5
Oi. Eu sou o processo 3 de 5

Por que as saÃdas sairam desordenadas? Porque elas rodaram em paralelo e nÃ£o temos como saber qual foi sua ordem de execuÃ§Ã£o. Assim cada nÃ³ entrou no printf em um momento diferente e imprimiu seu rank e seu size naquele momento. VocÃª pode experimentar usar o parÃ¢metro -np com outros nÃºmeros maiores ou menores que 5.

Troca de Mensagens

AtÃ© aqui nÃ£o hÃ¡ muita graÃ§a porque nÃ£o hÃ¡ troca de mensagens. HÃ¡ muito o que se dizer sobre como trocar mensagens do MPI mas a maneira mais fÃ¡cil de se comeÃ§ar Ã© com a funÃ§Ã£o mpi_send.

Vamos fazer um programa bem simples onde o nÃ³ 0 vai mandar uma mensagem para o nÃ³ 1. A mensagem vai ser um nÃºmero, 42. Criemos um arquivo chamado msg.c com o cÃ³digo:

#include
#include

int size, rank, msg, source, dest, tag;

int main(int argc, char *argv[]){
   MPI_Status stat;

   MPI_Init(&argc,&argv);
   MPI_Comm_size(MPI_COMM_WORLD,&size);
   MPI_Comm_rank(MPI_COMM_WORLD,&rank);

	if(rank==0){
   	msg = 42; dest = 1; tag = 0;
   	MPI_Send(&msg, 1, MPI_INT, dest, tag, MPI_COMM_WORLD);
   	printf("Processo %d enviou %d para %d.\n", rank, msg, dest);
	}

	if(rank==1){
		source = 0; tag = 0;
		MPI_Recv(&msg, 1, MPI_INT, source, tag, MPI_COMM_WORLD, &stat);
		printf("Processo %d recebeu %d de %d.\n", rank, msg, source);
	}

   MPI_Finalize();
}

No processo de rank 0 vamos enviar o conteÃºdo da variÃ¡vel inteira msg para o processo de rank 1. Note que no processo de rank 1, o valor de msg nÃ£o estÃ¡ definido. O comando MPI_Send vai receber 6 parÃ¢metros.

int MPI_Send( void *buf, int count, MPI_Datatype datatype, int dest, int tag, MPI_Comm comm)

void *buf, um ponteiro para a mensagem que vocÃª vai mandar. No nosso caso a variÃ¡vel inteira msg.
int count, a quantidade de elementos que tem nessa mensagem. No nossa caso sÃ³ 1. Se quisemos mandar um vetor de dois inteiros, seria 2.
MPI_Datatype datatype, uma constante que define o tipo de dados que vocÃª estÃ¡ enviando. No nosso caso MPI_INT. Isso evita que ajam incompatibilidade no tamanho de inteiros entre arquiteturas diferentes.
int dest, o rank do nÃ³ destino, o destinatÃ¡rio. No nosso caso o nÃ³ 1.
int tag, a tag seria num email o assunto da mensagem. Estamos mandando tag 0 entÃ£o no outro lado tem que estar esperando uma tag 0, caso contrÃ¡rio nÃ£o hÃ¡ comunicaÃ§Ã£o.
MPI_Comm comm, o comunicador. Nesse e na maioria dos casos a constante MPI_COMM_WORLD.

Do outro lado, no processo 1 vamos usar o MPI_recv, que recebe 7 parÃ¢metros.

int MPI_Recv( void *buf, int count, MPI_Datatype datatype, int source, int tag, MPI_Comm comm, MPI_Status *status)

void *buf, um ponteiro para onde vai ser guardada a mensagem que vamos receber. No nosso caso a variÃ¡vel msg, que no processo 1 estÃ¡ vazia.
int count, a quantidade de elementos que vem nessa mensagem.
MPI_Datatype datatype, a mesma constante do MPI_send.
int source, o rank do nÃ³ remetente. No nosso caso o nÃ³ 0.
int tag, a tag da mensagem conforme explicado no MPI_send.
MPI_Comm comm, o comunicador.
MPI_Status *status, uma estrutura para que depois que a funÃ§Ã£o for executada vocÃª possa inspecionar detalhes da transmissÃ£o. No nosso caso ela Ã© inÃºtil.

Para compilar esse exemplo usamos novamente o mpicc.

mpicc msg.c -o msg

E para executa-lo o mpirun.

mpirun -np 2 msg

O programa vai escrever essa mensagem:

Processo 0 enviou 42 para 1.
Processo 1 recebeu 42 de 0

No processo 1 a msg estava inicialmente vazia e no processo 0 havia 42, mas depois do MPI_recv o processo 1 pode escrever o conteÃºdo 42 de msg. Logo, houve comunicaÃ§Ã£o.

Dicas

Por um problema no empacotamento do mpich no Ubuntu toda vez que vocÃª executa o MPI vocÃª recebe umas mensagens horrorosas de erro, que na verdade sÃ£o sÃ³ um aviso que ele nÃ£o encontrou uma placa de rede Infiniband.

Para vocÃª silenciar na unha essa chatice use o mpirun assim:

mpiexec –mca btl ^openib -np 1 executÃ¡vel

Onde -np 1 deve ser substituido pelo seu nÃºmero de processos e executÃ¡vel pelo seu executÃ¡vel.

Outra dica Ã© que vocÃª pode utilizar uma distribuiÃ§Ã£o Linux que jÃ¡ venha com o MPI instalado. Por exemplo o Scientific Linux ou o Parallel Knoppix.