How to deal with long argument list

In summary: I'm assuming global variables are bad if you can avoid them, but I'm not really sure where to start.In summary, global variables can be useful if their usage is documented and their scope is limited to the function in which they are used.
  • #1
icwchan
6
0
Hi everyone,

So I'm familiar with the popular advice that global variables should be avoided whenever possible, but I'm wondering how seriously people in scientific programming obey this rule of thumb.

I am current engaged in writing some code for semiconductor transport, and I'm finding that I require a lot of data that needs to accessed practically everywhere in the program (eg. for example, I have a set of wavefunctions that are used in a multitude of different calculations.) I tried avoiding the global route at first, but the argument lists for my subroutines were getting obscenely long (I stopped at around 15 for one of the functions). Finally exasperated with the constant parameter list fiddling and debugging, I gave up and bundled everything up into a (almost) universally accessible module (in case you were wondering, I'm doing the heavy number crunching in Fortran 90 with f2py for use in a python framework).

I'm sure I'm not the first person to encounter this problem, but most of my googling turns up results from CS guys who think global variable users should be drawn and quartered; I hope it's okay to ask for a second opinion here, hopefully from someone experienced in scientific programming... What's the usual approach to passing around large quantities of data internally in a program?

Thank you in advance!

IC
 
Technology news on Phys.org
  • #3
Thanks for the quick reply, but I have no troubles understanding *how* to use global variables. My issue is *when*, if ever, can I or should I use them. The website you linked only addresses the former.

More specifically, how does one handle situations in which large quantities of data need to be accessed all over a program? Is massive parameter lists really the way to go? Or do people just leaving it floating around and readily accessible, like in a MATLAB script?
 
  • #4
Instead of having an argument list with a whole raft of parameters, have one parameter of type STRUCTURE. It's been awhile since I've done any programming in Fortran, and I don't remember writing functions/subroutines with STRUCTURE parameters. That's how it's done in C and C++ and related languages when you want to pass a lot of information into a function and don't want to have to hassle with a bunch of parameters.

In general, it's considered better practice to pass parameters to a function rather than use global variables.
 
  • #5
I don't see an issue with truly global variables if their usage by functions is documented. One advantage is that you can do a text search to find all references to a global variable in a set of source files. In the case of a passed parameter, the function's local name for the variable may be different, making it more difficult to locate all references to a specifc variable that is global in nature. I would recommend you keep all actual instances of global variable declarations in a single (or as few as possible) source module(s), rather than having them scattered throughout set of files, but it seems you've already done this.

If the issue is a long argument list as opposed to a variable that is truly global in nature, then as suggested, using a structure would be a good idea.
 
Last edited:
  • #6
If you're using python, args and kwargs clean up a lot of the really long function defs: Langauge Reference

And you can pass dictionaries into functions if you use the same keywords the function does, so if the function is maths(sum,total) and vars = {'total':12,'sum':8}, maths(vars) will work the same as maths(8,12)

My latest approach to this problem (something like 20 params as of now) is to bundle up my data as an object and to pass the object around to the various functions that need the parameters. I guess this is the python equivalent of using structs.

My old method (and I still use it for some testing purposes) is to build a dictionary of my parameters and then call a function to grab the dictionary whenever I need values, so:
Code:
def getparams():
    pars = {'par1':...
               ...
              'parn':...}
    return pars
and then whenever I need to pull something, it's just
pars = getparams()
 
Last edited:
  • #7
If you are dealing with constants e.g. the number PI or the speed of light etc. then you don't pass that as an argument into a function. Such constants are always global or part of some object/class that can be accessed globally.
Global constants are perfectly fine. It's just global variables that should be avoided if possible. But of course there are also cases where global variables are the best option. E.g. in a game the data structure that represents the "world" is usually global.
 
  • #8
Someone correct me if I'm wrong, but the impression I'm taking away from this is that with some caveats, I do not need to avoid globals like the plague (eg. okay for "world" data, keep it all in one file etc). Which is good, since that is the answer I was hoping for. Or I can try to pass around a large custom structure (not sure if f2py handles that very well, however, but will check...)

Thank you to everyone for their response, I appreciate it.
 
  • #9
In many places, you absolutely do need to avoid globals like the plague. Globals rank right up there with "go to" in terms of "things to avoid at all costs" (aka "things that require a waiver from high-level management").

It sounds like you are not using any concepts from object oriented programming at all. If you are using anything that resembles a modern programming language you should at a bare minimum be using encapsulation techniques.
 
  • #10
As DH mentioned, it depends on the group of people you work with or the authors of CS books and their standards (if there are any). Some groups treat programming more like a religion than a science, and it's in these groups where you find that globals and goto's are to be avoided. One of the things you typically do at a new programming job is to find out what the groups standards are, which usually involves naming and documentation conventions. There may be actual written standards, or you can simply ask others in the group and browse the existing code.

Getting back to the OP, Fortran 90 supports structures, so you could use those for parameters that can be logically grouped. I'm not sure if or how these structures can be passed between your Fortran and Python code.
 
Last edited:
  • #11
rcgldr said:
Some groups treat programming more like a religion than a science, and it's in these groups where you find that globals and goto's are to be avoided.
I would argue that it is the other way around: It is those places that do not treat programming as a science where you find excessive use of globals and gotos. One of the earliest results of treating programming as a science was finding what aspects of programming were highly correlated with buggy code. Right at the top of the list are global variables and gotos. Organizations ban their use precisely because they treated programming as a science.
 
  • #12
D H said:
... global variables and gotos. Organizations ban their use precisely because they treated programming as a science.
Getting a bit off topic, goto's were discussed in this ancient thread:

https://www.physicsforums.com/showthread.php?t=308782

Similar type exceptions exist for the usage of globals. In a multi-threaded or multi-tasking environment, you have some variables, like message queues, that are global in nature. Constants or the nearly equivalent of variables that are initialized only one time are also valid candidates to be globals.
 
  • #13
Sort of new here and haven't dealt with Fortran for 25 years, but this is what I do in all my designs.

I have a class or structure called DirectiveInstructions, which captures all the config. It is called first when the main thread starts. It can do three things:

1. Read command line arguments
2. Read a config file
3. Read DB table

So now I can put all things, that I may want to fiddle with without re-compiling the program, in one or more of the above places. This class also contains all fixed variables, default values and strings I would use throughout.

A reference to this class is passed everywhere so every thread, object or function that needs "Directives" has them at hand. Nicely separates the config from your code and you only have one pointer to pass.
 

Related to How to deal with long argument list

1. How can I manage a long argument list in my scientific research?

One approach to dealing with a long argument list is to break it down into smaller, more manageable chunks. This can be achieved by grouping related arguments together or creating sub-functions to handle different parts of the argument list. Also, considering the use of data structures, such as arrays or objects, can help to organize and simplify the argument list.

2. What is the impact of having a long argument list on the efficiency of my research?

A long argument list can make your code more complex and difficult to understand, which can significantly impact the efficiency of your research. It can also lead to errors and make debugging more challenging. Therefore, it is essential to find ways to manage and simplify your argument list.

3. How can I determine if my argument list is too long?

There is no specific number of arguments that can be considered too long, as it depends on the complexity and purpose of your research. However, a good rule of thumb is to keep your argument list as short as possible while still maintaining the functionality and readability of your code.

4. What are some best practices for dealing with a long argument list?

Some best practices for managing a long argument list include keeping it well-organized and structured, using meaningful and consistent naming conventions, and avoiding unnecessary or redundant arguments. It is also helpful to regularly review and optimize your argument list as your research progresses.

5. How can I communicate my argument list effectively to my peers or collaborators?

When working with a long argument list, it is essential to provide clear and concise documentation for your code. This can include comments within the code itself, as well as a separate document outlining the purpose and usage of each argument. It is also helpful to discuss and review the argument list with your peers or collaborators to ensure mutual understanding and effective communication.

Similar threads

  • Programming and Computer Science
Replies
13
Views
1K
  • Programming and Computer Science
Replies
4
Views
2K
  • Programming and Computer Science
Replies
2
Views
2K
  • Programming and Computer Science
Replies
3
Views
1K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
22
Views
3K
  • Programming and Computer Science
Replies
2
Views
2K
  • Programming and Computer Science
Replies
1
Views
3K
Replies
1
Views
1K
  • STEM Academic Advising
Replies
2
Views
882
Back
Top