OpenGL>Selection

Selection by pointing

What

Selection of an object in a scene by pointing at it

The question we raise is how should we identify a part of a scene by pointing at it. We can easily see that this problem is different in 2D and 3D. This module attempts to investigate this in 3 situations:

general algorithm to check if a point is inside a polygon
how to hit something in a transformed 2D environment
how to hit something in a 3D scene in OpenGL

The illustration to the right shows a identified and hilited torus in a chain of toruses.

General algorithm

In general we can solve the problem by identifying a polygon that contains our click-point. We do assume that polygons does not overlap. The polygons we test our point against should be convex and concave.

Testing a point, p, against a rectangle, r, is trivial.

 inside =(p.x < r.right)&&(p.x > r.left)&&(p.y < p.bottom)&&(p.y > p.top);

The following two methods are a sketch for a general inside test. The first, Intersection, finds possible intersections between lines. Written i c.

BOOL Intersection(
          CPoint&p1,CPoint& p2,
          CPoint&p3,CPoint& p4,
          CPoint&ps)
{
  /* finds the intersecton between lines p1-p2 and p3-p4
     return TRUE if intersection, FALSE else
     result in ps, if intersection
     use parametric equations for the lines
  */
  int dx1=p2.x-p1.x;
  int dx2=p4.x-p3.x;
  int dy1=p2.y-p1.y;
  int dy2=p4.y-p3.y;
  int n=dx2*dy1-dy2*dx1;
  if(n==0) return FALSE;
  double s=1.0*(dx1*(p3.y-p1.y)-dy1*(p3.x-p1.x))/(1.0*n);
  if ((s<0.0)||(s>1.0)) return FALSE;
  double t=1.0*(dx2*(p3.y-p1.y)-dy2*(p3.x-p1.x))/(1.0*n);
  if((t<0.0)||(t>1.0)) return FALSE;
  ps.x=(int)(dx1*t+p1.x);
  ps.y=(int)(dy1*t+p1.y);
  return TRUE;
}

The other method, Inside, may be used to test if a point is inside a polygon:

BOOL Inside(CPoint p)
{
  /* Have we hit the polygon in pointarray POL ?
     assume POL closed, POL [0]== POL [POL.GetSize()-1]
     if we have a sorrounding box for the polygon
     we should test this first
     For instance like this:
      if(!m_BigR.PtInRect(p))
            return FALSE;
  */
  // Count intersections to the left of p
  // odd is hit, even is miss
  int pix;
  CPoint p1,p2,p3,p4,ps;
  BOOL Inside=FALSE;
  p1.x=-<a number smaller than smallest x in POL >;
  p1.y=p.y;
  p2.x=<a number greater than greatest x in POL >;
  p2.y=p.y;
  for(pix=0;pix< POL.GetSize()-1;pix++)
  {
    p3= POL [pix];
    p4= POL [pix+1];
    if(Intersection(p1,p2,p3,p4,ps))
      if (ps.x < p.x )
        Inside=!Inside;
  }
  return Inside;
}

Transformed 2D

The situation is getting a little more tricky if we draw in 2D and use transformations (rotate, translate, scale) when we draw. Most drawing environments, like Java SDK and .net (C++,VB,C#). In Java we find the transformation routines in class graphics2D.

We can use the strategy described in 2D transformations, and build a matrix for drawing. When we click on a drawing that has been transformed, we must transform the click point from screen coordinates to "model-coordinates" by the inverse transformation matrix and then do the test.

3D

In 3D the reasoning gets more complicated.

The scene is transformed to a coordinate system that involves 3 D. We can still get the inverse transformation matrix, but the setting is more complex since it also involves a perspective transformation.
We will have problems deciding which component in the scene is closest to the eye. We cannot in general anticipate any relations between components regarding distance from eye.

In OpenGL this situation is resolved by a strategy that is involved in the rendering process:

The user points at the scene, clicks
The program generates a small clipping volume with centre in the mouse click
The program repeats the rendering of the scene that involves the new small clipping volume. Nothing is changed on the screen during this "extra" rendering.
During this rendering we can give names to parts of the scene that are potential hit targets. The names that are active during clipping are stored by OpenGL during the rendering.
After the extra rendering we can pick up all the names that has been noted, and we can read their z-coordinates (distance as seen from the eye)

An implementation of this strategy could be like this:

private int pickSelected(GL gl,int x,int y)
{
   GLU glu = new GLU();
  int BUFSIZE=1024;
  int hits;
  int viewport[]=new int[4];// to store current viewport
  int closestix;
  int ix;
  IntBuffer buf=BufferUtil.newIntBuffer(BUFSIZE);
  gl.glGetIntegerv(GL.GL_VIEWPORT,viewport,0);// get viewport
  gl.glSelectBuffer(BUFSIZE,buf);// tell OGL where to report
  gl.glRenderMode(GL.GL_SELECT);// as opposed to GL_RENDER
  gl.glInitNames();
  gl.glPushName(-1);
  gl.glMatrixMode(GL.GL_PROJECTION);
  gl.glPushMatrix();
  gl.glLoadIdentity();
  // set up a small clipping pyramid
  glu.gluPickMatrix(x,(viewport[3]-y),5.0f,5.0f,viewport,0);
  // same view as we see it, same as in reshape
  glu.gluPerspective(45.0f,(float)viewport[2] / (float)viewport[3],
                     1.0f, 200.0f);
  gl.glMatrixMode(GL.GL_MODELVIEW);
  //--------------------
  // draw it without rendering
  drawToruses(gl,GL.GL_SELECT);
  //----------------
  gl.glMatrixMode(GL.GL_PROJECTION);
  gl.glPopMatrix();
  gl.glMatrixMode(GL.GL_MODELVIEW);
  hits=gl.glRenderMode(GL.GL_RENDER);// gives me a hitcount
  // process the ix+1 hits
  // selectBuf contents:
  //  ix  : number of names on the hitstack when hit occurred
  //  ix+1: min z-depth of "object" associated with name, relative
  //  ix+2: max z-depth of "object" associated with name, relative
  //  ix+3: the name, ie index, assosiated with the hit
  //  etc
  if(hits ==0)
    return -1;
  else
  {
    // we have hit one or more "objects"
    // if only one we simply select it
    if(hits ==1)
       return buf.get(3);//the only one in store
    // more than one is hit.  Which should we select ?
    // The closest one,
    // go for the one which closest part is closest
    closestix=0;
    for(ix=1;ix<hits;ix++)
      if(buf.get(4*ix+1) < buf.get(4*closestix+1))
        closestix=ix;
    return buf.get(4*closestix+3);
 }
}

The display routine uses this method according to this scheme:

public void display(GLAutoDrawable drawable) {
    GL gl = drawable.getGL();
    // Enable VSync
    gl.setSwapInterval(1);
    // Clear the drawing area
    gl.glClear(GL.GL_COLOR_BUFFER_BIT | GL.GL_DEPTH_BUFFER_BIT);
    
   if(m_selecting)// we have pressed right mousebutton
       m_hilited=pickSelected(gl,(int)m_lastMouseX,(int)m_lastMouseY);
    gl.glLoadIdentity();
    gl.glTranslatef(0.0f,0.0f,-m_zoom_dist);
    gl.glRotatef(m_view_rotx, 1.0f, 0.0f, 0.0f);
    gl.glRotatef(m_view_roty, 0.0f, 1.0f, 0.0f);
    // draw the scene with normal rendering
    drawToruses(gl,m_Drawing_Mode);
    gl.glFlush();
}

References

Program which displays toruses with selection: https://svn.hiof.no/svn/psource/JOGL/hitme

Maintainance

B Stenseth, May 2009

(Welcome) OpenGL>Selection (Marching cubes)