Search code examples
memory-managementsegmentation-faultfortranallocatable-array

Fortran Allocatable Array Member of a User-Defined Type


I am stuck with segmentation fault at an allocatable array memberof a derived type in the following simple program. This segmentation fault occurs only on one machine (with Intel Fortran 14.0.3 on openSUSE) but not on the other machine (with Intel Fortran 14.0.2 on Ubuntu) that I tried. Also, if I change one of the integer parameters in the program, the program ends normally.

Could anybody reproduce the prolem? Could anybody tell me what is wrong with the code?

Below are the three source code files.

main_dbg.f90 .. whether the segmentation fault occurs or not depends on the values of n1 and n2 in this file.

PROGRAM dbg
  USE tktype
  USE mymodule, ONLY : MyClass, MyClass_constructor
  IMPLICIT NONE

  INTEGER(I4B)                :: n1,n2,n3
  TYPE(MyClass)               :: o_MyClass

  n1=23
  n2=32
  ! .. this does not work.
  ! n2=31 
  ! .. this works.
  n3 = n1*n2
  write(*,'(1X,A,I10)') 'n1=', n1
  write(*,'(1X,A,I10)') 'n2=', n2
  write(*,'(1X,A,I10)') 'n3=', n3

  o_MyClass = MyClass_constructor(n1, n2, n3) 

  call o_MyClass%destructor()
  write(*,*) '***************************'
  write(*,*) '   Normal End :)           '
  write(*,*) '***************************'

END PROGRAM dbg

strange.f90 .. segmentation fault occurs at the forall construct in this file.

!*******************************************************************
MODULE mymodule
!*******************************************************************
  USE tktype
  IMPLICIT NONE
  PRIVATE

  PUBLIC MyClass
  PUBLIC MyClass_constructor

  TYPE :: MyClass
     PRIVATE
     REAL(DP),     DIMENSION(:),     ALLOCATABLE :: arrA
     COMPLEX(DPC), DIMENSION(:,:,:), ALLOCATABLE :: arrB
   CONTAINS
     PROCEDURE :: destructor
  END TYPE MyClass

! ================================================================
CONTAINS
! ================================================================

  ! ****************************************************************
  FUNCTION MyClass_constructor(n1, n2, n3) RESULT(this)
  ! ****************************************************************
    TYPE(MyClass)                :: this
    INTEGER(I4B),    INTENT(IN)  :: n1, n2, n3
    ! local variables
    INTEGER(I4B) :: j1, j2, j3

    write(*,'(1X,A)') 'entered constructor..'

    allocate(this%arrA(n2))
    allocate(this%arrB(n1, n2, n3))

    this%arrA = 1.0_dp

    write(*,*) 'size(this%arrB,1) =', size(this%arrB,1)
    write(*,*) 'n1                = ', n1
    write(*,*) 'size(this%arrB,2) =', size(this%arrB,2)
    write(*,*) 'n2                = ', n2
    write(*,*) 'size(this%arrB,3) =', size(this%arrB,3)
    write(*,*) 'n3                = ', n3

    forall(j1=1:n1, j2=1:n2, j3=1:n3)
       this%arrB(j1,j2,j3)  = this%arrA(j2) 
    end forall

    write(*,'(1X,A)') '..leaving constructor'

  END FUNCTION MyClass_constructor


  ! ****************************************************************
  SUBROUTINE destructor(this)
  ! ****************************************************************
    CLASS(MyClass),             INTENT(INOUT) :: this

    deallocate(this%arrA)
    deallocate(this%arrB)

  END SUBROUTINE destructor

END MODULE mymodule

tktype.f90

! ********************************************************************
MODULE tktype
! ********************************************************************
!   module tktype is an extraction of module nrtype in Numerical Recipes in 
!   Fortran 90.
! ********************************************************************
  !   Symbolic names for kind types of 4-, 2-, and 1-byte integers:
  INTEGER, PARAMETER :: I4B = SELECTED_INT_KIND(9)
  INTEGER, PARAMETER :: I2B = SELECTED_INT_KIND(4)
  INTEGER, PARAMETER :: I1B = SELECTED_INT_KIND(2)
  !   Symbolic names for kind types of single- and double-precision reals:
  INTEGER, PARAMETER :: SP = KIND(1.0)
  INTEGER, PARAMETER :: DP = KIND(1.0D0)
  !   Symbolic names for kind types of single- and double-precision complex:
  INTEGER, PARAMETER :: SPC = KIND((1.0,1.0))
  INTEGER, PARAMETER :: DPC = KIND((1.0D0,1.0D0))
  !   Symbolic name for kind type of default logical:
  INTEGER, PARAMETER :: LGT = KIND(.true.)
END MODULE tktype

Below is a shell script to compile the source codes above and run the generated executable.

compile_run.sh

#!/bin/bash

ifort -v 
echo "compiling.."
ifort -o tktype.o -c -check -g -stand f03 tktype.f90
ifort -o strange.o -c -check -g -stand f03 strange.f90
ifort -o main_dbg.o -c -check -g -stand f03 main_dbg.f90
ifort -o baabaa strange.o tktype.o main_dbg.o
echo "..done"
echo "running.."
./baabaa
echo "..done"

The standard output looked as following.

ifort version 14.0.3
compiling..
..done
running..
 n1=        23
 n2=        32
 n3=       736
 entered constructor..
 size(this%arrB,1) =          23
 n1                =           23
 size(this%arrB,2) =          32
 n2                =           32
 size(this%arrB,3) =         736
 n3                =          736
./compile_run.sh: line 11: 17096 Segmentation fault      ./baabaa
..done

Edit 2016-01-30

I found that adding ulimit -s unlimited at the beginning (after #/bin/bash) of compile_run.sh prevents the segmentation fault. Are the allocatable arrays in fortran stored in stack, not in heap?


Solution

  • This may be a possible duplicate of the similar question (Segmentation fault on 2D array), where some multi-dimensional forall loop causes a problem. The OP of the linked question asked this in the Intel forum (ifort v 14.0 / 15.0 "-g" option causes segFault), and the latest reply is like the following:

    Workaround #1 is to increase stack size limit. I was successful with your test case using: ulimit -s unlimited

    Workaround #2 is to use DO loops instead of FORALL, as follows:

    Also, according to casey's comment in the linked question, this problem does not occur for ifort16, so I guess it may be a compiler issue specific for ifort14/15.


    More info (just some experiment):

    The same problem was reproduced on my computer by limiting the stack size to ulimit -s 4000 and using ifort14.0.1, and it disappeared with the -heap-arrays option. So I initially thought that there may be some automatic arrays or array temporaries of size n1 * n2 * n3, but there seems no such thing in the original code... Attaching -assume realloc_lhs or -check -warn did not help either.

    So I have made a test program that performs the same calculation using do or forall:

    program main
        implicit none
        integer, parameter :: dp  = KIND(1.0D0)
        integer, parameter :: dpc = KIND((1.0D0,1.0D0))
        type Mytype
            real(dp),     allocatable :: A(:)
            complex(dpc), allocatable :: B(:,:,:)
        endtype
        type(Mytype) :: t
        integer :: n1, n2, n3, j1, j2, j3
    
        n1 = 23
        n2 = 32
        n3 = n1 * n2   !! = 736
    
        allocate( t% A( n2 ), t% B( n1, n2, n3 ) )
    
        t% A(:) = 1.0_dp
    
        print *, "[1] do (3-dim)"
        do j3 = 1, n3
        do j2 = 1, n2
        do j1 = 1, n1
            t% B( j1, j2, j3 ) = t% A( j2 ) 
        enddo
        enddo
        enddo
    
        print *, "[2] do (1-dim)"
        do j2 = 1, n2
            t% B( :, j2, : ) = t% A( j2 ) 
        enddo
    
        print *, "[3] forall (1-dim)"
        forall( j2 = 1:n2 )
            t% B( :, j2, : ) = t% A( j2 ) 
        end forall
    
        print *, "[4] forall (3-dim)"   ! <-- taken from the original code
        forall( j1 = 1:n1, j2 = 1:n2, j3 = 1:n3 )
            t% B( j1, j2, j3 ) = t% A( j2 )
        end forall
    
        print *, "all passed."
    end program
    

    where pattern [4] corresponds to that used by OP. Limiting the stack size and compiling with no option (ulimit -s 4000 ; ifort test.f90) gives the output

     [1] do (3-dim)
     [2] do (1-dim)
     [3] forall (1-dim)
     [4] forall (3-dim)
    Segmentation fault
    

    which means that only the pattern [4] fails when -heap-arrays is not attached. Strangely, the problem disappears when arrays A and B are declared outside the derived type, i.e., the following program works with no options.

    program main
        implicit none
        integer, parameter :: dp  = KIND(1.0D0)
        integer, parameter :: dpc = KIND((1.0D0,1.0D0))
        real(dp),     allocatable :: A(:)
        complex(dpc), allocatable :: B(:,:,:)
        integer :: n1, n2, n3, j1, j2, j3
    
        n1 = 23
        n2 = 32
        n3 = n1 * n2   !! = 736
    
        allocate( A( n2 ), B( n1, n2, n3 ) )
    
        A(:) = 1.0_dp
    
        print *, "[1] do (3-dim)"
        do j3 = 1, n3
        do j2 = 1, n2
        do j1 = 1, n1
            B( j1, j2, j3 ) = A( j2 ) 
        enddo
        enddo
        enddo
    
        print *, "[2] do (1-dim)"
        do j2 = 1, n2
            B( :, j2, : ) = A( j2 ) 
        enddo
    
        print *, "[3] forall (1-dim)"
        forall( j2 = 1:n2 )
            B( :, j2, : ) = A( j2 ) 
        end forall
    
        print *, "[4] forall (3-dim)"
        forall( j1 = 1:n1, j2 = 1:n2, j3 = 1:n3 )
            B( j1, j2, j3 ) = A( j2 )
        end forall
    
        print *, "all passed."
    end program
    

    So it seems that the problem occurs only in some particular case of multi-dimensional forall loop (even without -g option), which might be using an internal temporary array on the stack (though -check -warn option gives no message). FYI, all the above patterns work with gfortran 4.8/5.2 and Oracle fortran 12.4.