Warning: mysqli::__construct(): (HY000/1203): User howardkn already has more than 'max_user_connections' active connections in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\includes\artfuncs.php on line 21
Failed to connect to MySQL: (1203) User howardkn already has more than 'max_user_connections' active connections
Warning: mysqli::query(): Couldn't fetch mysqli in D:\Inetpub\vhosts\howardknight.net\al.howardknight.net\index.php on line 66
Article <20240319191859.00004bc8@yahoo.com>
Deutsch   English   Français   Italiano  
<20240319191859.00004bc8@yahoo.com>

View for Bookmarking (what is this?)
Look up another Usenet article

Path: ...!eternal-september.org!feeder3.eternal-september.org!news.eternal-september.org!.POSTED!not-for-mail
From: Michael S <already5chosen@yahoo.com>
Newsgroups: comp.lang.c
Subject: Re: filling area by color atack safety
Date: Tue, 19 Mar 2024 19:18:59 +0200
Organization: A noiseless patient Spider
Lines: 217
Message-ID: <20240319191859.00004bc8@yahoo.com>
References: <ut3669$21eur$1@i2pn2.org>
	<86h6h3nvyz.fsf@linuxsc.com>
	<865xxiok09.fsf@linuxsc.com>
	<20240319131842.00002138@yahoo.com>
	<utbuk3$qed7$1@dont-email.me>
MIME-Version: 1.0
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit
Injection-Info: dont-email.me; posting-host="a8871862f6b018cf47b195a984142ca8";
	logging-data="789293"; mail-complaints-to="abuse@eternal-september.org";	posting-account="U2FsdGVkX1+kp+Di46e5lCY2QMue68IoZwO080PXJ08="
Cancel-Lock: sha1:Dfc5H6MgFhcJZcVM0uLdlPxjbQQ=
X-Newsreader: Claws Mail 3.19.1 (GTK+ 2.24.33; x86_64-w64-mingw32)
Bytes: 8366

On Tue, 19 Mar 2024 11:57:53 +0000
Malcolm McLean <malcolm.arthur.mclean@gmail.com> wrote:

> On 19/03/2024 11:18, Michael S wrote:
> > On Mon, 18 Mar 2024 22:42:14 -0700
> > Tim Rentsch <tr.17687@z991.linuxsc.com> wrote:
> >   
> >> Tim Rentsch <tr.17687@z991.linuxsc.com> writes:
> >>
> >> [...]
> >>
> >> Here is the refinement that uses a resizing rather than
> >> fixed-size buffer.
> >>
> >>
> >> typedef     unsigned char               Color;
> >> typedef     unsigned int                UI;
> >> typedef     struct { UI x, y; }         Point;
> >> typedef     unsigned int                Index;
> >>
> >> static  _Bool  change_it( UI w, UI h, Color [w][h], Point, Color,
> >> Color );
> >>
> >> void
> >> fill_area( UI w, UI h, Color pixels[w][h], Point p0, Color old,
> >> Color new ){ static const Point deltas[4] =  {  {1,0}, {0,1},
> >> {-1,0}, {0,-1},  }; UI     k      =  0;
> >>    UI     n      =  17;
> >>    Point *todo   =  malloc( n * sizeof *todo );
> >>
> >>      if(  todo  &&  change_it( w, h, pixels, p0, old, new )  )
> >> todo[k++] = p0;
> >>
> >>      while(  k > 0  ){
> >>          Index j = n-k;
> >>          memmove( todo + j, todo, k * sizeof *todo );
> >>          k = 0;
> >>
> >>          while(  j < n  ){
> >>              Point p = todo[ j++ ];
> >>              for(  Index i = 0;  i < 4;  i++  ){
> >>                  Point q = { p.x + deltas[i].x, p.y + deltas[i].y
> >> }; if(  ! change_it( w, h, pixels, q, old, new )  )
> >> continue; todo[ k++ ] = q;
> >>              }
> >>
> >>              if(  j-k < 3  ){
> >>                  Index new_n = n+n/4;
> >>                  Index new_j = new_n - (n-j);
> >>                  Point *t = realloc( todo, new_n * sizeof *t );
> >>                  if(  !t  ){  k = 0;  break;  }
> >>                  memmove( t + new_j, t + j, (n-j) * sizeof *t );
> >>                  todo = t,  n = new_n,  j = new_j;
> >>              }
> >>          }
> >>      }
> >>
> >>      free( todo );
> >> }
> >>
> >> _Bool
> >> change_it( UI w, UI h, Color pixels[w][h], Point p, Color old,
> >> Color new ){ if(  p.x >= w  ||  p.y >= h  ||  pixels[p.x][p.y] !=
> >> old  ) return  0; return  pixels[p.x][p.y] = new,  1;
> >> }  
> > 
> > This variant is significantly slower than Malcolm's.
> > 2x slower for solid rectangle, 6x slower for snake shape.
> > Is it the same algorithm?
> >   
> 
> No. Mine takes horizontal scan lines and extends them, then places
> the pixels above and below in a queue to be considered as seeds for
> the next scan line. (It's not mine, but I don't know who invented it.
> It wasn't me.)
> 
> Tim, now what does it do? Essentially it's the recursive fill
> algorithm but with the data only on the stack instead of the call and
> the data. And todo is actually a queue rather than a stack.
> 
> Now why would it be slower? Probaby because you usually only hit a
> pixel three times with mine - once below, once above, and once for
> the scan line itself, whilst you consider it 5 times for Tim's - once
> for each neighbour and once for itself. Then horizontally adjacent
> pixels are more likely to be in the same cache line than vertically
> adjacent pixels, so processing images in scan lines tends to be a bit
> faster.
> 

I did a little more investigation gradually modifying Tim's code for
improved performance without changing the basic principle of the
algorithm. Yes, micro-optimization. Yes, I said earlier that doing so
in c.l.c it is bad sportsmanship. So what? I never claimed to be an
ideal sportsman.
The point is that after optimizations it's actually faster than the
best implementations of original recursive algorithm, including
implementation that uses explicit stack and is quite economical in its
memory consumption. Tim's algorithm is 8 times less economical (8 bytes
per saved node vs 1 byte in explicit stack) and nevertheless almost
twice faster for both shapes that I was testing.
So far, this algorithm is fastest among all "local" algorithms that I
tried. By "local" I mean algorithms that don't try to recolor more than
one pixel at time.
"Non-local" algorithms i.e. yours and my recursive algorithm that
recolors St. George cross, are somewhat faster, but I suspect that
it's because all shapes that I use for testing have either long
columns or long rows or both.
The nice thing about Tim's method is that we can expect that
performance depends on number of recolored pixels and almost nothing
else. 
The second nice thing is that it is easy to understand. Not as easy as
original recursive method, but easier than the rest of them.

If you or somebody else is interested, here is [micro]optimized variant:


#include <stdlib.h>
#include <stddef.h>
#include <string.h>


typedef unsigned char        Color;
typedef int                  UI;
typedef struct { UI x, y; }  Point;

static inline
Point* circularIncr(Point* p, Point* beg, Point* end) {
  return p + 1 == end ? beg : p + 1;
}

static inline
Point mk_point(int x, int y) {
  Point pt={x,y};
  return pt;
}

int floodfill_r(
  Color *pixels,
  int w,     int h,
  int pt0_x, int pt0_y,
  Color old, Color new)
{
  if (pt0_x < 0 || pt0_x >= w || pt0_y < 0 || pt0_y >= h)
    return 0;

  if (pixels[pt0_y*w+pt0_x] != old)
    return 0;

  pixels[pt0_y*w+pt0_x] = new;

  const ptrdiff_t INITIAL_TODO_SIZE = 125;
  Point *todo =  malloc( (INITIAL_TODO_SIZE+3) * sizeof *todo );
              // +3 is extra size to assist wrap-around of wr 
  if (!todo)
    return -1;
  Point* todo_end = &todo[INITIAL_TODO_SIZE];

  todo[0] = mk_point(pt0_x, pt0_y);
  Point* wr = &todo[1];
  Point* rd = todo;
  ptrdiff_t free_space = INITIAL_TODO_SIZE - 1;
  do {
    Point pt = *rd;
    rd = circularIncr(rd, todo, todo_end);
    Point* prev_wr = wr;
    if (pt.x > 0 && pixels[pt.y*w+pt.x-1] == old) {
      pixels[pt.y*w+pt.x-1] = new;
      *wr++ = mk_point(pt.x-1, pt.y);
    }
    if (pt.y > 0 && pixels[pt.y*w+pt.x-w] == old) {
      pixels[pt.y*w+pt.x-w] = new;
      *wr++ = mk_point(pt.x, pt.y-1);
    }
    if (pt.x+1 < w && pixels[pt.y*w+pt.x+1] == old) {
      pixels[pt.y*w+pt.x+1] = new;
      *wr++ = mk_point(pt.x+1, pt.y);
    }
    if (pt.y+1 < h && pixels[pt.y*w+pt.x+w] == old) {
========== REMAINDER OF ARTICLE TRUNCATED ==========